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We introduce and explore a particular approach to the analysis of autonomous, Pfaf- 



cal properties of autonomous, Pfaffian non-holonomic systems in classical mechanics. The 
method is based on the construction of a certain auxiliary constrained Hamiltonian system, 
which comprises the non-holonomic mechanical system as a dynamical subsystem on an 
invariant manifold. The embedding system possesses a completely natural structure in the 
context of symplectic geometry, and using it in order to understand properties of the subsys- 
tem has compelling advantages. We discuss generic geometric and topological properties of 
the critical sets of both embedding and physical system, using Conley-Zehnder theory and 
by relating the Morse- Witten complexes of the 'free' and constrained system to one another. 
04 ■ Furthermore, we give a qualitative discussion of the stability of motion in the vicinity of 

^ I the critical set. We point out key relations to sub-Riemannian geometry, and a potential 

computational application. 

b ■ 

O ■ Introduction 
m ■ 
O 

Plc fian non-holonomic systems in classical mechanics, which renders them naturally accessible 

I to the methods of symplectic and sub-Riemannian geometry. We note that typical examples 

of systems encountered in sub-Riemannian geometry emerge from optimal control, or Vako- 
nomic' problems, which are derived from a different variational principle (minimization of the 

•i-H . 

^ ■ Carnot-Caratheodory distance) than the Euler-Lagrange equations of classical mechanical 

I systems with non-holonomic constraints (the Holder variational principle, cf. jH], and sec- 

tion III in this paper). The strategy is based on the introduction of an artificial Hamiltonian 
system with constraints that are compatible with the symplectic structure, constructed in a 
manner that it comprises the non-holonomic mechanical system as a dynamical subsystem 
on an invariant manifold. The main focus of the discussion in this paper aims at the geomet- 
rical and topological properties of the critical sets of both embedding and mechanical system, 
on the stability of equilibria, and an application of the given analysis to a computational 
problem. 

There exists a multitude of different approaches to the description and analysis of non- 
holonomic systems in classical mechanics, stemming from various subareas of application. 

The geometrical approach given here has been strongly influenced by j2Z] and ^1 
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A construction for the Lagrangian case, which is closely related to what will be presented 
in section III, has been given in ^31- A different approach in the Hamiltonian picture is 
dealt with in [35j. A geometrical theory of non-holonomic systems with a strong influence 
of network theory has been developed in [HHI- The geometrical structure of non-holonomic 
systems with symmetries and the associated reduction theory, as well as aspects of their 
stability theory has been at the focus in the important works |i7| |2Sl I2S EHI; and other 
papers by the same authors. 

This paper is structured as follows. In section I, we introduce a class of Hamiltonian 
systems with non-integrable constraints. Given a symplectic manifold (M, u) and a non- 
integrable, symplectic distribution V, we focus on the flow $j generated by the component 
of the Hamiltonian vector field Xh in V. In section II, we study the geometry and 
topology of the critical set € of the constrained Hamiltonian system. The main technical 
tool used for this purpose is a gradient-like flow (pt, whose critical set C is identical to that 
of Assuming that the Hamiltonian if : M — > M is a Morse function, it is proved that 
generically, C is a normal hyperbolic submanifold of M. Using Conley-Zehnder theory, we 
prove a topological formula for closed, compact that is closely related to the Morse-Bott 
inequalities. A second, alternative proof is given, based on the use of the Morse- Witten 
complex, to elucidate relations between the 'free' and the constrained system. In section 
III, we give a qualitative, partly non-rigorous discussion of the stability of the constrained 
Hamiltonian system, and conjecture a stability criterion for the critically stable case. A proof 
of the asserted criterion, which would involve methods of KAM and Nekhoroshev theory, is 
beyond the scope of the present work. We derive an expression for orbits in the vicinity of 
a critically stable equilibrium that is adapted to the flag of V, and point out relations to 
sub-Riemannian geometry. 

In section IV, we consider Hamiltonian mechanical systems with Pfaffian constraints. 
We show that for any such system, there exists an auxiliary constrained Hamiltonian system 
of the type introduced in section I. We study the global topology of the critical manifold 
of the constrained mechanical system, and again discuss the stability of equilibria. Finally, 
we propose a computational application, a method to numerically determine the generic 
connectivity components of the critical manifold. 
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1. A NON-INTEGRABLE GENERALIZATION OF DIRAC CONSTRAINTS 

Let (M, cu, H) be a Hamiltonian system, where M is a smooth, symplectic 2n- manifold 
with C°° symplectic structure u G A^(M), and where H G C°°{M) is the Hamiltonian 
function. For p = 1, . . . , 2n, let Ap{M) denote the C°°(M)-module of p-forms on M. The 
Hamiltonian vector field G r(TM) is determined by 

where i stands for interior multiplication. Given a smooth distribution W C TM, r{W) 
will denote the C°°(M)-module of smooth sections of W. The Hamiltonian flow is the 1- 
parameter group G Diff(M) generated by Xh, with t G R, and $o = id. Its orbits are 
solutions of 

(1) dMx) = XniMx)) 

for all X G M, and t G M. 

Let us first recall some standard facts about Dirac constraints that will be useful in 
the subsequent construction. Let, for f,g & C°°, 

(2) {f,g} = u;iXf,Xg) 

denote the smooth, non-degenerate Poisson structure on M induced hj uj. It is a derivative 
in both of its arguments, and satisfies the Jacobi identity {f,{g,h}} + (cyclic) = 0, thus 
(C°°(M), {-, ■}) is a Lie algebra. Then, ([TJ translates into 

(3) dJiMx)) = {HJ}{Mx)), 

for all / G C°°(M), and all x G M, t G M. 

Let j : M' M be an embedded, smooth, 2A;-dimensional symplectic submanifold of 
M, endowed with the puUback symplectic structure j*uj. The Dirac bracket corresponds to 
the induced Poisson bracket on M', 

{f,g}n = {fco)iXj,X-,) , 

defined for any pair of extensions f,g E C°°{M) oi f,g G C°°(M'). If M' is locally 

characterized as the locus of common zeros of some family of functions Gi G C°°(M), 

i = 1, . . . ,2{n — k), the following explicit construction of the Dirac bracket can be given. 

Since M' C M is symplectic, the (n — fc)^ functions locally given by Dij := {Gi,Gj} 

can be patched together to define a matrix-valued C°° function that is invertible everywhere 
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on M' . The explicit formula for the Dirac bracket is locally given by 



(4) {f,g}D = {frg] - D^^ {G.rg] , 



where D'^^ denotes the components of the inverse of [-Djj]. 

This construction can be put into the more general context of a symplectic distribution. 

Definition 1.1. A distribution V over the base manifold M is symplectic ifVx is a symplectic 
subspace of T^M with respect to uJx, for all x G M. Its symplectic complement is the 
distribution which is fibrewise uo-skew orthogonal to V . Furthermore, an embedding / C M 
M that is tangent to V is called V -horizontal. 

Clearly, V-^ is by itself symplectic, and smoothness of V and uj implies smoothness of 
V-^. Furthermore, the Whitney sum bundle V^SV^-*" is TM. Thus, let V denote an integrable, 
smooth, symplectic rank 2A;-distribution V over M. Clearly, any section X G T{TM) has a 
decomposition 

where X^^^^ G Viy^^^), so that ci;(X^,X^^) = 0. Furthermore, there exists an tu-skew 
orthogonal tensor vry : TM — > TM with 

Ker(7ry) = , 7ry(X) =X VX G V{y) , 

which satisfies 

(5) u{MX)X) = ^{X,MY)) 

for all X, y G T{TM). It will be referred to as the c<j-skew orthogonal projection tensor 
associated to V . Let Yi, . . . , Y2k denote a local spanning family of vector fields for V . Then, 
V being symplectic is equivalent to the matrix [Cij\ := [c<j(Ki, Y^)] being invertible. 

Lemma 1.1. Let [C^'] denote the inverse of [Cij], and let 6j := iy^oo G A^(TM). Then, 
locally, TTv = C^^i (8> Oj. 

Proof. The fact that C^-'Fj is a projector, and that (Ej) holds, follows from Oi{Yj) = Cij, 
and CijC^^ = 5-. Its rank is clearly 2k, and it is straightforward to see that its kernel is 
given by T{V^). D 



1.1. Non-Integrable Constraints. The quadruple {M,uj,H,V) naturally defines a dy- 
namical system whose orbits are all ^-horizontal. Its flow is simply the 1-parameter group 
of diffeomorphisms $i generated by := HviXfj) € r(^), with 

(6) dMx) = X^i^tix)) 

for every x G M. In a local Darboux chart, where uj is represented by 

P) ^ - ( X 

and where x{t) stands for the vector of coordinate components of $t(a), is given by 

(8) dtx{t) = {P, J d,H) (x(t)) = {J Pi d,H) (x(t)) . 

P denotes the matrix of vry, and P^ is its transpose. 

If the condition of integrability imposed on V is dropped, this dynamical system will 
allow for the description of non-holonomic mechanics. If V is integrable, M is foliated into 
2/i;-dimensional symplectic integral manifolds of V. On every leaf j : M' M, the induced 
dynamical system corresponds to the pullback Hamiltonian system {M, j*uj, H o j). In this 
sense, ( Ej) generalizes the Dirac constraints. 

A new class of dynamical systems is obtained by discarding the requirement of in- 
tegrability on V. Let [■,■] denote the Lie bracket. We recall that the distribution V is 
non-integrable if there exists a flltration 

(9) K) C C ^ C • ■ ■ C Vr, 

inductively deflned by Vq = V, and Vi = -|- [Vq, where Vi ^ Vq. The sequence {V^}^ 
is called the flag of V . If the flbre ranks of all Vi are base point independent, V is called 
equiregular. The smallest number deg(V) at which the flag of V stabilizes, that is, for which 
K = Kiog(v')) Vs > deg(l^), is called the degree of non-holonomy of V . If Vdeg(y) = TM, one 
says that V satisfles Chow's condition, or that it is totally non-holonomic. 

Proposition 1.1. (Frobenius condition) V is integrable if and only if locally, 

(10) := {ttvWv)- {dri-rrv)': - 5s(vry)9 = 
everywhere on M. 

Proof. V is integrable if and only if vfy {[ttv{X),7!'y{Y)]) = for all sections X,Y of TM, 
which is equivalent to Vi = V. The asserted formula is the local coordinate representation 
of this condition. □ 



1.2. An Auxiliary Almost Kahler Structure. For the analysis in subsequent sections, 
it will be useful to introduce an almost Kahler structure on M that is adapted to V. To 
this end, let us briefly recall some basic definitions. Let g denote a Riemannian metric on 
M. An almost complex structure J is a smooth bundle isomorphism J : TM — > TM with 

= —1. Together with g, it defines a two form satisfying 



for all sections X, Y e r{TM). g is hermitean if g{JX, JY) — g{X,Y), and Kahler if ujg^j 
is closed. The triple {g, J, ojg^j) is called compatible. Every symplectic manifold admits an 
almost complex structure J, and a Kahler metric g, such that {g, J,uj) is compatible. 

Proposition 1.2. For any symplectic manifold (M, a;), together with a symplectic distribu- 
tion V, there exists a compatible triple {g, J,u!), such that ny is symmetric with respect to g, 
and TTvJX = JttvX for all X e T{TM). 

Proof. We pick a smooth Riemannian metric g on M, relative to which vry is symmetric, for 
instance by choosing an arbitrary Riemannian metric g' on M, and defining g{X,Y) :— 
g^TTyXjUvY) + g'{7ivX,nvY), where vfy = 1 — Try. We consider the non-degenerate, 
smooth bundle map K defined by u!{X,Y) — g{KX,Y), which is skew symmetric with 
respect to g, that is, its ^-adjoint is K* — —K. K*K — —K"^ is smooth, positive def- 
inite, non-degenerate and ^-symmetric, hence there is a unique smooth, positive definite, 
^-symmetric bundle map A defined by — —K^, which commutes with K. Conse- 
quently, the bundle map J = KA~^ satisfies — —1, and defines an almost complex 
structure. Since A is positive definite and ^-symmetric, g{X,Y) := g{AX,Y) is a Rie- 
mannian metric with uj{X,Y) = g{JX,Y). Moreover, this metric is hermitean, since 
g{JX,JY) = g{KX,A~^KY) = -g{X,K^A-^Y) = g{X,AY) = g{X,Y). In fact, since 
LU is closed, g is Kahler. 

To show that vry is ^(-symmetric, we note that Try commutes with K, since g{KTrvX, Y) — 
u;(7rvX,Y) = u;{X,7rvY) = g{KX,7rvY) = g{7rvKX,Y) for all X,Y e r{TM), using that 
Try is symmetric with respect to g. Hence, Try commutes with = —K^, and it straight- 
forwardly follows from the ^-orthogonality of ^4, Try, and the positivity of A that Try and A 
commute. Hence, Try is g^-symmetric, and it is also clear that Try commutes with J = KA~^. 
Thus, J in particular restricts to a bundle map J :V ^V. □ 




ujg,j{X,Y) = g{JX,Y) 
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1.3. Further Properties. Some key properties of Hamiltonian systems concerning sym- 
metries, Poisson brackets, energy conservation, and, to some degree, symplecticness, can be 
generalized to the constrained system. 

1.3.1. Symmetries. Let us assume that the Hamiltonian system (M, u, H) admits a symplec- 
tic G-action {G some Lie group) \E' : G — Diff (M), such that \l/^ci; = uj and H = H for 
all h E G. Then, we will say that the constrained system (M, uj,H,V) admits a G-symmetry 
if ^j^^V = V holds for all h e G. 

1.3.2. Generalized Dirac bracket. The smooth, M-bilinear, antisymmetric pairing on G°°(M) 
associated to (M, cu, V) given by 

(12) {f,g}v := u;{MXf),MXg)) 

is a straightforward generalization of the Dirac and Poisson brackets. Along orbits of 
one has 

for all X G M, in analogy to ( Ej) • However, the bracket ( [T2)) does not satisfy the Jacobi 
identity if V is non-integrable, but it satisfies a Jacobi identity on every (symplectic) integral 
manifold if V is integrable. 

1.3.3. Energy conservation. This key conservation law also holds for the constrained system. 

Proposition 1.3. The energy H is an integral of motion of the dynamical system 

Proof. This follows from the antisymmetry of the generalized Dirac bracket, which implies 
that dtH = {H, H}v = 0. □ 

1.3.4. Symplecticness. The flow $t is not symplectic, but the following holds. Let us consider 

= - $4 ( {TTv)i diH dx^ ) = ( di{7Tv)i diH dx^ A dx'' ) 

= -\^*t{{di{^v)k - dki7rv)i)mdx' A dx'^) . 

Hence, the restriction of dt^^u to X, F G T{V) is given by 

d,^MX,Y) = -hl^AlAHX^Y^) , 

where A* ^ is defined in lemma 11.11 Thus, the right hand side vanishes identically if and only 
if V is integrable. In the latter case, the restriction of toVxV equals its value for t = 0, 

7 



given by u;(7ry(-), 7Ty{-)). On every integral manifold j : M' ^ M ofV, is symplectic with 
respect to the pullback symplectic structure j*uj. 

2. THE GEOMETRY AND TOPOLOGY OF THE CRITICAL MANIFOLD 

In this main section, we address geometrical and topological properties of the critical 
set <t of the constrained Hamiltonian system (M, uj,H,V). The main result of the subsequent 
analysis is, for M compact and without boundary, the topological formula ( IT5|) that inter- 
relates the Poincare polynomials of M and €gen in a manner closely akin to the Morse-Bott 
inequalities. This result implies that the topology of M necessitates the existence of certain 
connectivity components of €gen of a prescribed index. The analysis is structured as follows. 

In section 12. H we prove that € is, in the sense of Sard's theorem, generically a smooth 
2{n — fc)-dimensional submanifold €.gen C M. For the special case in which V is integrable, 
it is shown that the intersection of any integral manifold of V with C^en is a discrete set, 
in agreement with the usual understanding that critical points in Hamiltonian systems - on 
every leaf of the foliation in the integrable case - are typically isolated. 

In section 12.21 we introduce the main tool for the analysis of (t, an auxiliary gradient- 
like flow (pt G Diff (M) generated by the vector field iTyVgH, where g is the Kahler metric of 
the compatible quadruple introduced after proposition 11.21 From this point on, we assume 
that if : M — > M is a Morse function. Let j : €.gen ^ M denote the embedding. We show 
that <tgen is normal hyperbolic with respect to (pt, and that the critical points of j*H on 
Cgen are precisely those of H on M. The latter is quintessential for our discussion of the 
topology of (tgen via comparison of the Morse- Witten complexes of {Cgen,j*H) and of (M, H) 
in section 1231 

In section 17!^ we prove ( ITHj) by an application of Conley-Zehnder theory, ^^I; to the 
auxiliary gradient-like system. The assumptions on € are slightly less strict than genericity. 
In particular, assuming that €. \ (tgen is a disjoint union of manifolds, we show that every 
connectivity component of C \ (tgen is contained in a H-level surface, and that €. \ €gen can 
be deformed away by an infinitesimal perturbation of the vector field. 

In section we assume that € = Cgen, and give a second proof based on the compari- 
son of the Morse- Witten complexes of (M, H) and {^gen, -f^kjen)- O^r construction only uses 
the theory for non-degenerate Morse functions, not for Morse-Bott functions. The interest 
in this discussion is to elucidate the relationship between critical points of the 'free' system 
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{M,H), and of the critical manifold (Egen of the constrained system {M, H,V). The special 
case of mechanical systems (where M is noncompact) will be analyzed in a later section. 

2.1. Generic Properties of the Critical Set. Let us to begin with recall some basic 
definitions. Critical points of H are given by zeros of dH, and a corresponding value of 
H is called a critical value. A critical level surface T^e corresponds to a critical value E of 
H, whereas a regular level surface T^e contains no critical points of H (the corresponding 
value of E is then called regular). The critical set of the constrained Hamiltonian system 
(M, u!,H,V) is given by 

€ = {xeM\ X^{x) = 0} C M . 
The following theorem holds independently of the fact whether V is integrable or not. 

Theorem 2.1. In the generic case, the critical set is apiecewise smooth, 2 {n—k)- dimensional 
submanifold of M. 

Proof. Let {Yi}f^i denote a smooth, local family of spanning vector fields for V over an open 
neighborhood U C M. Since V is symplectic, the fact that Xj^ is a section of V implies 
that uj{Yi,Xj^) cannot be identically zero for all i and everywhere in U. Due to the a;-skew 
orthogonality of vry, and TfyYi = Yi, 

u;{Y„X^) = uiMY^^Xn) = uj{Y„Xh) = Y,{H) . 

Thus, with F := (Fi (//),... , Y2k{H)) G C^{U, M^^), it is clear that €nU = F-\0). Since F 
is smooth, Sard's theorem implies that regular values, having smooth, 2{n — /i;)-dimensional 
submanifolds of U as preimages, are dense in FjJJ) [28|. □ 

For future technical convenience, we pick a local spanning family {Yi G T{y)\fl^ for 
V that satisfies 

with J := ^ ^ "o^ ^ ■ "^^^^ choice is always possible. 

Furthermore, introducing the associated family of 1-forms {Oi} by Oi{-) := c<j(Fj, ■), 
theorem 11.11 implies that 

Try = X^Yi ® , 

where J*-^ are the components of = —J. Expanding X^ with respect to the basis {Yi} 
gives 

(13) X^ = MX^) = - y^iH) Y, , 
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where one uses the relationship 6j{Xj^) = Yj{H) obtained in the proof of theorem 12.11 
Then, the following proposition holds, which is in the subsequent discussion interpreted as 
the property of normal hyperbolicity of £ with respect to a certain gradient-like flow if the 
genericity assumption is satisfied. 

Proposition 2.1. Under the genericity assumption of theorem \2.1l the 2k x 2k-matrix given 
by [Yk(Yi{H)){a)] is invertible for all a G and every local spanning family {Yi G r(y)} of 
V. 

Proof. Let us pick a local basis {^i}?^ for V, and {Zj}'f'"'~''^ for V-^, which together span 
TM. Let a G €.gen, and assume the generic situation of theorem 12.11 Because €.gen is defined 
as the set of zeros of the vector field ( IT^ , the kernel of the linear map 



dFJ-)f'Y 



where Fi := Yi{H), is precisely Ta^geni and has a dimension 2{n — k). 

In the basis {Yi\a, . . . , Y2k\a, Zi\a, ■ ■ ■ , ^2(n-fc)|a}, its matrix is given by 

A = [Av Av±] , 

where Ay := [Yi{Fj)J^''Yk\a], and Av± := [Zi{Fj)J^''Yk\a]- Bringing A into upper triangular 
form. Ay is likewise transformed into upper triangular form. Because the rank of A is 2k, 
and Ay is a 2A; x 2A;-matrix, its upper triangular form has 2k nonzero diagonal elements. 
Consequently, Ay is invertible, and due to the invertibility of J, one arrives at the assertion. 

□ 

Corollary 2.1. Let {Yi}f^i denote a local spanning family for V, and let {Xi}f^-^^ be a local 
spanning family (of sections) of N(tgen, interpreted as a vector bundle over <tgen that is 
embedded in U^^n^^^T^M . Then, the matrix [g{Yi, Xj){x)]fj^i is invertible for every x G Cgen- 

Corollary 2.2. Let (Egen satisfy the genericity assumption of theorem \2.1\ IfV is integrable, 
the intersection of any integral manifold of V with (tgen is a discrete set. 

Proof. The previous proposition implies that generically, integral manifolds of V intersect 

<tgen transversely. Their dimensions are mutually complementary, hence the intersection set 

is zero-dimensional. □ 
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2.2. Normal Hyperbolicity and an Auxiliary Gradient-Like System. In this section, 
we introduce tlie main tool necessary for the analysis of the topology of given by an 
auxiliary gradient-like flow on M whose same critical set is also C Furthermore, we show 
that generically, €. is normal hyperbolic with respect to the latter. 

2.2.1. A Generalized Hessian. To begin with, we define a generalized Hessian for C The 
usual coordinate-free definition of the Hessian of H is VdH, evaluated at the critical points 
of H, where V is the Levi-Civita connection of the Kahler metric g. Let Hy denote the dual 
projection tensor associated to vry, which acts on sections of the cotangent bundle T*M, so 
that for any 1-form 9, {nyO, X) = {9, TXyX). The generalization of the Hessian in our context 
is the tensor V(7ry(iif), which acts as a bilinear form on T{TM) x T{TM) by way of 

V(4di/)(X,F) := iyx{'r:UH).Y) 

= {{{7ry)iH,),,-r„{rcv)iH,)X^Y\ 

where T^- are the Christoffel symbols. Evaluating this quantity on <t, the second term in the 
bracket on the lower line is zero. The non-vanishing term is determined by the matrix 

(14) [Krs] := [ii7rv)iH,),s]. 

One straightforwardly verifies that {iTvYiKjk = Kik is satisfied everywhere on (t, hence 
rank{K} < ranklvry} = 2k =rank{V^}. Clearly, the corank of K\a equals the dimension of 
the connectivity component of (t containing a. 

2.2.2. Definition of the Gradient-Like System. A flow is gradient-like if there exists a function 
/ : M ^ R that decreases strictly along all of its non-constant orbits. The flow of the 
constrained Hamiltonian system is not gradient- like, and hence turns out to be of limited 
use for the study of the global topology of because invariant sets of $t do in general not 
only contain fixed points, but also periodic orbits. 

Instead, we introduce the auxihary dynamical system 

(15) dMt) = -invVgH){^it)) , 

where 7 : / C M ^ M, which turns out to be an extremely powerful tool for our purpose. 
Let us denote its flow by (pt G Diff(M). The orbits of ( I15|) are clearly ^-horizontal, and 
both $^ and (pt exhibit the same critical set C 

Proposition 2.2. The flow (pt is gradient-like. 
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Proof. Since 

= -9 {T^V^gH , TlvVgH) (7(t)) < 0, 

it follows that H decreases strictly along the non-constant orbits of (pf We have here used 
the fact that {g, J,uj,7Cv) is a compatible quadruple. □ 

{g, J,uj,7rv) has been constructed for this precise reason. It is immediately clear that 
(pt generates no periodic trajectories, hence €. comprises all invariant sets of (pf 

2.2.3. Morse Functions and Non-Degenerate Critical Manifolds. Let us next recall some 
standard definitions from Morse- and Morse-Bott theory that will be needed in the subse- 
quent discussion. The dimensions of the zero and negative eigenspaces of the Hessian of / 
at a critical point a are called the nullity and the index of the critical point a. If all critical 
points of / : M ^ M have a zero nullity, / is called a Morse function, and the index is 
then called the Morse index of a. If the critical points of / are not isolated, but elements 
of critical manifolds that are non-degenerate in the sense of Bott, H is called a Morse-Bott 
function [H]. Throughout this section, we will assume that if is a Morse function. 

Furthermore, we recall some standard definitions related to normal hyperbolicity, ap- 
plied to the case of C A connectivity component is locally normal hyperbolic at the 
point a G C with respect to (pt if it is a manifold at a, and if the restriction of Aa to the 
normal space Na€. is non-degenerate. A connectivity component is called non-degenerate 
if it is a manifold that is everywhere normal hyperbolic with respect to (pt- The index of a 
non-degenerate connectivity component is the number of eigenvalues of the constrained 
Hessian Aa on €i that are contained in the negative half plane. 

Proposition 2.3. IfC is generic in the sense of Sard's theorem, it is normal hyperbolic with 
respect to the gradient-like flow (pt. 

Proof. This follows straightforwardly from proposition 12. II □ 

Let (ti, i = 1, ..,1 denote the connectivity components of C = UCj, and let ji : fti ^ M 
denote the embedding of the i-th components. 

Proposition 2.4. Assume that <t satisfies the genericity assumption in the sense of Sard's 
theorem, and that H : M is a Morse function. Then, Hi := H o : (ti ^ W is a Morse 
function, and x G Cj is a critical point of Hi if and only if it is a critical point of H . 

12 



Proof. It is trivially clear that every critical point of i7 is a critical point of Hi. To prove 
the opposite direction, suppose that a is an extremum of H\(ti. Then, VgH\a G NaC, but 
also, by definition of (t, PaVgH\a = 0. By corollarv 12.11 this can only be true if VgHa = 0. 
The Hessian of the restriction of H at any critical point of Hi is nondegenerate, thus Hi is 
a Morse function on Cj. □ 

Corollary 2.3. The critical points of H\(r^^^ : (tgen o,re precisely the critical points 

of H : M ^ M. If (ti is a non- generic connectivity component that is a normal hyperbolic 
submanifold of M, <ti C T,H{<ti)- 

Proof. The first assertion follows trivially from the previous proposition. Assuming that €i 
is a non-generic connectivity component of <t that is a manifold and normal hyperbolic, the 
previous proposition implies that there are no extrema of ifjc^. Thus, is a submanifold of 
the level surface □ 

2.3. Approach via Conley-Zehnder Theory. The goal in this and the next section is to 
derive the relationship ( fTH|) between the Poincare polynomials of and M. We first ap- 
proach this problem by use of Conley-Zehnder theory, [15_^ , and under slightly less restrictive 
assumptions than genericity. 

Let us for convenience first recall some of the key elements from Conley-Zehnder theory, 
[T3] . Let be any compact component of €. An index pair associated to is a pair of 
compact sets {Ni, Ni) that possesses the following properties. The interior of Ni contains Cj, 
and moreover, is the maximal invariant set under (pt in the interior of Ni. Ni is a compact 
subset of Ni that has empty intersection with Cj, and the trajectories of all points in Ni that 
leave Ni at some time under the gradient-like flow (pt intersect Ni. Ni is called the exit set 
oiNi. 

The homotopy type of the pointed space Ni/Ni only depends on C,, by a result proven 
in so that the relative cohomology H*{Ni, Ni) (with coefficients appropriately chosen) 
is independent of the particular choice of index pairs (the space Ni/Ni is obtained from 
collapsing the subspace Ni of Ni to a point). The equivalence class [Ni/Ni] of pointed 
topological spaces under homotopy is called the Conley index of C,. 

Let I denote a compact invariant invariant set under (pf A Morse decomposition of 
J is a finite, disjoint family of compact, invariant subsets {Mi, . . . ,M„} that satisfies the 
following requirement on the ordering. For every x E I \ UjMj, there exists a pair of indices 
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i < j, such that hmi^_oo 4>t{x) C Mj, and hm^^oo 4't{x) C Mj. Such an ordering, if it exists, 
is called admissible, and the Mj are called Morse sets of /. 

For every compact invariant set / admitting an admissibly ordered Morse decomposi- 
tion, there exists an increasing sequence of compact sets Ni with A^o C A^i C ■ ■ ■ C Nm, such 
that {Ni, Ni^i) is an index pair for Mj, and {Nm, Nq) is an index pair for /, [TH] . 

Consider compact manifolds A D B D C. It is a standard fact that the exact sequence 
of relative cohomologies 

■ ■ ■ H\A, B) H\A, C) H\B, C) ^ H''+\A, B) ^ . . . 

implies that, with rj_p denoting the rank of H^{Ni, Ni-i), 

i,p p 

for the indicated Poincare polynomials (cf. for instance [22] )• bj is the j-th Betti number 
of the index pair {Nm, Nq) of I, and Q(A) is a polynomial in A with non-negative integer 
coefficients. Due to the positivity of the coefficients of Q(A), it is clear that ^j^-j p > bp. 

If M is compact and closed, and if € is non-degenerate, the following holds. The 
invariant set / can be chosen to be equal to M. We let = M and A^o = denote the 
top and bottom elements of the sequence defined above, and order the connected elements 
of C according to the descending values of the maximum of H attained on each Cj. Then, €. 
furnishes a Morse decomposition for M. The homology groups of M are isomorphic to the 
relative homology groups of the index pair {Nm,NQ). Hence the numbers bp are the Betti 
numbers of M. 

Proposition 2.5. Let €i C ^gen be a generic connectivity component, compact and without 
boundary, and let {Ni, N^) denote any associated index pair. Then, 

(16) m+^^{N„N,) ^ H^li) , 

where fii is the index of (ti, and q = 0, . . . , dim{(Li). 

Proof. We consider, for eo > small, a compact tubular eo-neighborhood U of €i (of dimen- 
sion 2n), and let 

W[r{(Li) := {W-{€i)U(Li)nU 

denote the intersection of the center unstable manifold of (Li with U. W~{€i) denotes the 
unstable manifold of Pick some small, positive e < eo, and let Ue be the compact tubular 
e-neighborhood of Vr^"(£j) in U. 
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Letting e continuously go to zero, we obtain a homotopy equivalence of tubular neigh- 
borhoods, for which W[^^{<Ei) is a deformation retract. Let 

ur' := du, n MUe) 

denote the intersection of dUe with all orbits of the gradient-like flow that contain points in 
U^. Then, (f/^, [/°"*) is an index pair for Cj, and by letting e continuously go to zero, f/°"* is 
homotopically retracted to dW[f^{<ti). 
Thus, by homotopy invariance. 

Since €i is normal hyperbolic with respect to the gradient-like flow, has a constant 

dimension rii + everywhere, where Ui = dim^j. Therefore, by Lefschetz duality, jTHj, 

where /ij = the index of Cj. Since is a deformation retract of the interior of 

the respective cohomology groups are isomorphic. 

Due to dim{^i) = rii, we have by Poincare duality 

so that with q := rii — p, 

(17) H'^+^'*{U„ = H'^i^i^i), 

which proves the claim. □ 

From ( II 7j) . we deduce that r^ p = dimi7*'^~^'(Cj) (recalling that /ij is the index of Cj), 
hence = hp_^-{^i). Assuming that the number of connected components of ^ is finite, 
one thus obtains 

(18) J] A^+^»6p(c:,) = 5^ A%(M) + (1 + A)Q(A), 

i,p p 

which in particular implies J2iK-fj.ii^i) — ^p(^)- Setting A = —1, 

J2{-^r'%i^^) = $^(-i)^'x(c:.) = x(M), 

p i 

where x denotes the Euler characteristic. 
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Remark. In the case of mechanical systems, the phase space of the relevant constrained 
Hamiltonian system is non-compact, and the critical manifold is generally unbounded. There- 
fore, the arguments used here do not apply. However, since in that case, M and £ are vector 
bundles, we are nevertheless able to prove results that are fully analogous to (CHI)- 

We can prove a slightly more general result by relaxing the assumption of genericity. 

Proposition 2.6. Assume that £\ CCgen is a disjoint union of -manifolds. Then, 

(19) ^^'P = Y^b.X^ + {1 + X) Q(A) . 

i,p p 

hi p are the p-th Betti numbers of the connectivity components of ^gen, bp are the Betti 
numbers of M , and Q is a polynomial with non-negative integer coefficients. 

Proof. We show that Xjj can be infinitesimally perturbed such that C \ C^en is removed. 
Consider, for e > 0, the compact tubular neighborhoods 

(20) U,{(li) = {x G M I distg{x, ^i) < e} 

of connectivity components C £ \ C^en, where distg denotes the Riemannian distance 
function induced by g. We introduce a vector field X^, given by nvVgH in M \ Ue{Ci), and 
in the interior of every U^{^i) with C C \ Cgen, by 

(21) X,\, = 7rvVgHU + eh{x)VgH\^ . 

Here, h G C^{Ue{Ci), [0, 1]), obeying h\it. = 1 and h\du^(^Ci,) = is strictly monotonic along 
the the flow lines generated by TTy'W gH. h exists because €. \ Cgen is a disjoint union of 
C^-manifolds. 

For all C C \ Cgen, Vg-ff is strictly non-zero in Ue(Ci), as shown above. We have 

giX,,VgH) = i\\nvVgH\\l)ix) + ehix)i\\VgH\\l)ix), 

where we have used the (^-symmetry of vry, and = g{X, X). The first term on the right 

hand side is non-zero on the boundary of Ue{Ci), while the second term vanishes. Moreover, 
the second term is non-zero everywhere in the interior of Ue{€i). Therefore, X^ vanishes 
nowhere in f/^(Cj). Hence, X^ is a deformation of vryVgif, with critical set Cgen- Notably, 
€.gen cannot be removed in this manner, since it contains critical points of H. 

Since g{X^, VgH) is strictly positive in every Ue{^i), X^ generates a gradient-like flow. 
Since \\X^ — TTvVgH\\g < 0(e) everywhere on M, one can pick X^ arbitrarily close to TCvVgH 
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in the L°°-norm on r(TM) induced by || ■ \\g. Carrying out the Conley-Zehnder construction 
with respect to the flow generated by yields ( IT^ . This result does not require the 
assumption of normal hyperbolicity on (t. □ 



2.4. Approach via the Morse- Witten Complex. We will next provide a different deriva- 
tion of ( I19j) . based on the construction of the Morse- Witten differential complex. The mo- 
tivation is to clarify the orbit structure of the gradient-like system, and to devise an explicit 
construction that relates the Morse- Witten complexes of the free and constrained system to 
one another. This in particular only involves the corresponding theory for non-degenerate 
Morse functions. 

Let us to begin with briefly recall the basic framework of this construction. Let M 
be a compact, closed, orientable and smooth n-manifold, and let / : M — > M be a Morse 
function. Let denote the free Z-module generated by the critical points of / with a Morse 
index p. The set C = (BpC^ is the free Z-module generated by the critical points of /, and 
graded by their Morse indices. There exists a natural coboundary operator S : ^ C^~^^, 
with 6 o S = 0, whose construction we recall next, cf. |3l 13 IT7| 

Introducing an auxiliary Riemannian structure on M, we let W~ and denote the 
unstable and stable manifold of the critical point a of f under the gradient flow, respectively, 
and assign an arbitrary orientation to every W~. The orientation of M, together with the 
orientation of W~ at every critical point a induces an orientation of W^. Morse functions, 
for which all W~ and intersect transversely, are dense in C°°(M). The dimension of W~ 
equals the Morse index fi{a) of a, and the dimension of the intersection M (a, a') := W~r\W^ 
is given by max{/i(a) — /i(a'),0}. For pairs of critical points a and a' with relative Morse 
index 1, say /i(a) = /i(a') -|- 1, M{a,a') is a finite collection of gradient lines that connect a 
with a'. 

The intersection of M(a, a') with any regular level surface Sc of / with /(a) < /(Sc) = 
c < f{a') is transverse, and consists of a finite collection of isolated points. Then, one picks 
the orientation of Sc, which, combined with the section Vgf of its normal bundle, shall agree 
with the orientation of M. The submanifolds W'^ := W~ f] and I^Jt,, := IV^t n Sc of 
Ec are smooth, compact and closed, with complementary dimensions in Ec, and orientations 
picked above. Hence, their intersection number, which is often in this context written as 
(a, 6a') := ti(Vra~c, is well-defined, [20]. The coboundary operator of the Morse- Witten 
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complex is defined as the Z-linear map 5 : ^ C^+^, defined by 

6a' = (a, 6a')a. 

Theorem 2.2. The cohomology of the differential complex {C,6) is isomorphic to the de 
Rham cohomology of M, ker5/im5 = H*{M,Z). 

The proof can for instance be found in jU [T7| 123 ES] ■ If («5 So.') 7^ for a pair a and a' 
of critical points with a relative Morse index 1, we will say that they are effectively connected 
(by gradient lines). 

As is well-known, the existence of the Morse- Witten complex implies the strong Morse 
inequalities in the following manner. Let := kerS fl denote the p-th cocyle group, 
C the p-th coboundary group, and := \ the p-th cohomology group under 
5. Thus, dimifP = 6p(M) by theorem O From 

dimCP = 6p(M) + dimS^ + dim5P+\ 

where dimC^ = Np (the number of critical points of / with Morse index p), follows 

n n n 

(22) XPNp = J2 + (1 + A) ^ A^^Mim^P 

p=0 p=0 p=l 

(both B^ and iJ^^+i ^^-g empty). The coefficients of the polynomial Q(A) = ^A^^^dimi?^ 
are evidently non-negative and integer. Clearly, dimi?^ is the number of critical points of 
Morse index p that are effectively connected to critical points of Morse index p — 1 via 
gradient lines of /. 

2.4.1. Comparing the Complexes for the Free and Constrained System. The goal 
of our discussion here is to devise an explicit construction that relates the Morse- Witten 
complex of the free system {M,H) to the one on the critical manifold (Cgen, -f^lcgen), by a 
deformation of the gradient-like flow (pf This will yield f IT^. 

Let denote the i-th connectivity component of C^en, and Ai := {aj^i, . . . ,aj,m} the 
critical points of H contained in €{. Furthermore, let fi{ai^r) be the associated Morse indices 
of if : M ^ M, and Hi := H\,r^ denote the restriction of H to By proposition 12.31 and 
corollarv l2.31 Hi : — > M is a Morse function, whose critical points are precisely the elements 
of Ai. The index fi{^i) of equals the number of negative eigenvalues of the Hessian of H 
at any aj^.,. G Ai whose eigenspace is normal to Cj. 
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The Morse index of aj ,. with respect to Hi is thus fi{ai^r) To define the Morse- 

Witten complex associated to C,, we introduce the free Z-module generated by the elements 
of Ai, graded by the Morse indices p of the critical points of Hi, 

Ci = ®pCf. 

To construct the coboundary operator 6i : Cf Cf^^, one uses the gradient fiow on 
generated by Hi, thus obtaining 

(23) keT6i/im6i = H*{(Ei,Z). 
Application of (|221) shows that for every G Cgen, 

(24) J^A^iV,,, = 5^A%((2:,) + (l + A)5^A^'Mimi?f, 

p p p 

where Bf is the p-th coboundary group of the Morse- Witten complex of C,, and Ni^p is the 
number of critical points of Hi on £j of Morse index p. 

Since every critical point of H lies on precisely one generic component €i, the number 
Ng of critical points of H with a Morse index q is given by 

^p = ^Ni;p-K'^)- 

i 

Thus, combining ( I24|) with f one obtains 

i,p;€i€<Cgen q 

+ (1 + A) ^ A^-i (diuiB^ - dim^r^^*^^) . 

q ti^tgen 

Hence, ( [T^ is equivalent to the statement that the polynomial on the last line, which is 
multiplied by (1 + A), has non- negative integer coefficients. 

By a homotopy argument, we will now prove that for all q, 

(25) dimE^ > Y dim^*'"''^'^^ 

holds, thus obtaining an alternative proof of f [TI?|) . The main motivation here is to give an 
explicit construction that geometrically elucidates this relation, noting that the left hand 
side is defined by the fiow of the 'free' gradient-like system corresponding to {M,uj,H), 
while the right hand side is defined by the constrained gradient-like system corresponding to 
{M,uj,H,V). We note that dimi?^^ denotes the number of critical points of H with a 
Morse index g in Cj, which are effectively connected to critical points of Morse index p + 1 in 
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via gradient lines of the Morse function Hi on di. Therefore, the sum on the right hand 
side of ( equals the number of those critical points of H with a Morse index q, which are 
effectively connected to critical points of Morse index g + 1 via gradient lines of the functions 
H o jj on all generic Cj. Here, ji : (ti —>■ M denotes the corresponding inclusion maps. 

2.4.2. Proof of ( \25]) . Our strategy consists of constructing a homotopy of vector fields fs, 
with s G [0, 1], whose zeros are hyperbolic and independent of s, which generate gradient-like 
flows. They interpolate between vi := VgH, and vq, which is a vector field that is tangent 
to €gen- For every s G [0, 1], we construct a coboundary operator via the one-dimensional 
integral curves of Vg that connect its zeros. These coboundary operators are independent of 
s, and act on the free Z-module C of the Morse- Witten complex associated to (M, H). f 
then follows from a simple dimensional argument. 

Lemma 2.1. There exists Vq G r(TM), which is gradient-like, and tangent to €gen- Further- 
more, the zeros of are hyperbolic, and identical to the critical points of H . The dimension 
of any unstable manifold of the flow generated by —Vq equals the Morse index of the critical 
point of H from which it emanates. 

Proof. We recall the vector field constructed in the proof of proposition 12121 and consider 
the compact tubular e-neighborhoods U^{(Egen), defined similarly as in ( I2(jp. Furthermore, 
let Q = (resp. Q = 1 — Q) he (7-orthogonal, smooth tensors on TU<^{dgen) of fixed 
rank 2(n — k) (resp. 2k), with Ker{Q(a)} = Na^gen (resp. Ker{(5(a)} = Ta^gen) for every 

a, G Cgieri- 

We define Vq as follows. In M \ U^{(tgen), it shall equal X^, and that for x in U^{Cgen), 
it shall be given by 

Vo{x) := {nv^gH){x) + h{x){QVgH){x), 

where h : U^i^gen) — > [0, 1] is a smooth function obeying /ijcg^^ = 1 and h\Qjj^(Cgen) = 0- 

particular, h is assumed to be strictly monotonic along all non-constant trajectories of the 

flow generated by nyVgH, and dh shall vanish on €gen- 

It can be easily verified that vq possesses all of the desired properties. It generates 

a gradient-like flow, since outside of Ue{dgen), g(VgH,vo) = g(V gH, X^) > 0, as has been 

shown in the proof of proposition 12.61 In the interior of [/^(CCgen), one finds g(VgH,Vo) = 

WhvV gHW^-l- hHQV gH\\g, due to the gf-orthogonality both of nv and Q. The first term on the 

right hand side vanishes everywhere on dgen, but at no other point in Ue{dgen)- The second 
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term equals HQV^-f^H^ on ^gen- Since evidently, QV gH\^^^^ is the gradient field of the Morse 
function H\ir^^^ : C^en ~^ ^ relative to the Riemannian metric on T^g^n induced by its 
zeros are precisely the critical points of H on Cgen? and it possesses no other zeros. Because 
g{vQ, V gH) > except at the critical points of H , it is clear that —vq generates a gradient- 
like flow ipo^ti so that H is strictly decreasing along all non-constant orbits. Furthermore, it 
is clear from the given construction that Vq is tangent to ^gen- 

To prove the remaining statements of the lemma, we note that the Jacobian matrix of 
Vq at a in a local chart is given by 

(26) Dvo{a) = {DlHf + (P, - Qa){DlH)K 

There is no dependence on h because dh\^^^^ = 0. Furthermore, {DIhY is defined as the 
matrix [g'^^Hjk\a] in the given chart, and Pa denotes the matrix of 7ry(a). Normal hyper- 
bolicity follows from the invertibility of DvQ{a), which is verified in the proof of lemma ITT) 
below. □ 

Lemma 2.2. Let Vs '■= sVgH + (1 — s)vq with s G [0, 1]. Then, the flow ips^t generated by 
—Vs is gradient-like for any s G [0, 1]. The zeros of Vg are hyperbolic fixed points of ipg^t, ond 
independent of s. Thus, the dimensions of the corresponding unstable manifolds equal the 
Morse indices of the critical points of H from which they emanate, for all s. 

Proof. We consider giV gH, Vg) = s|| Vgi^H^ + (1 — s)g{'VgH, vq). The first term on the right 
hand side is obviously everywhere positive except at the critical points of H, and the same 
has been proved previously for the second term. Thus, H decreases strictly along all non- 
constant orbits of ijjs,t, hence the latter is gradient-like. The Jacobian of Vs at a critical point 
of H is given by 

Dvsia) = (l2„ + (1 - s){Pa - Qa)){DlH)K 

Dvs{a) is invertible for all s G [0, 1], since (DlH)^ is invertible, and spec{Pa — Qa} C (—1, 1). 
To prove the latter, we first observe that specjPa — <5a} C [—1, 1] is trivial, because Pa and Qa 
both have a spectrum {0, 1}. {—1, 1} is not included, because otherwise, PaQa, respectively 
PaQai would not havc a full rank, in contradiction to corollarv 12.11 □ 

By smoothness of f^, it follows that ips^t is C°° in s. Thus, s smoothly parametrizes a 
homotopy of stable and unstable manifolds of ips^t emanating from the critical points of H. 
Since the fixed points of ips^t are independent of s, and the dimensions of the corresponding 
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unstable manifolds are equal to the Morse indices of the critical points of H, we consider, for 
every value of s G [0, 1], the free Z-module C = (BpC^ that is generated by the critical points 
of H, and graded by their Morse indices. For every s, we define a coboundary operator 
on C, using iljs,t as follows. Picking a pair of critical points of H with a relative Morse 
index 1, we consider the unstable manifold of a, and the stable manifold W^^, of a' 
associated to ips^f Since s parametrizes a homotopy of such manifolds, they naturally inherit 
an orientation from the one picked for s = 1 in the construction of the coboundary operator 
of the Morse- Witten complex for {M,H). 

Let Tie denote a regular energy surface for H{a) < E < H{a'). W^^ intersects Te 
transversely, because H is strictly decreasing along all non-constant orbits generated by —Vg. 
W~^nT,E and W/^/flE^; define two homotopies of oriented submanifolds of S^;. By homotopy 
invariance of their intersection number, the coboundary operators are independent of s, and 
thus identical to the (5-operator of the Morse- Witten complex given for s = 1. 

The stable and unstable manifolds of '?/'o.t are either confined to some Cj, or connect 
critical points lying on different €j's. Let us consider pairs of critical points of H with a 
relative Morse index 1 that lie on the same component G €.gen, and the corresponding 
stable and unstable manifolds of ipo,t which are contained in £j. Since vold is the projection 
of VgH\ir. to TCj, these stable and unstable manifolds are the same as those which were used 
to define the Morse- Witten complex on Hi). Using only stable and unstable manifolds of 
ipo^t contained in €gen, we construct an operator 6 acting on C in the same manner in which 
the coboundary operator was defined, thus obtaining 6 = (BiSi, where 6i is the coboundary 
operator of the Morse- Witten complex associated to the pair {^i,Hi). Let Pi : C ^ Ci 
stand for the projection of the free Z-module C generated by all critical points of H to 
the one generated by the critical points contained in €j. Eliminating all integral lines of 
—Vo that connect critical points on different connectivity components of €gen in the above 
construction, one sees that 6i = PiSPi, thus 6 = Pi6Pi. Hence, clearly, 

dim(im(5|cp) > dim(im5|cp) , 

which precisely corresponds to ( . This completes the proof. 

3. QUALITATIVE ASPECTS RELATED TO CRITICAL STABILITY 

So far, we have established that in the generic case, the connectivity components of 
^ = *^gen are embedded submanifolds of dimension 2{n — k) equal to the corank of V. 
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Furthermore, we have seen that the topology of the symplectic manifold M enforces the 
existence of connectivity components of C^en of certain prescribed indices with respect to the 
auxiliary gradient-like flow (pf 

In this section, we focus on the physical dynamics, characterized by the flow gener- 
ated by XYji of the constrained Hamiltonian system (M, cj, if, V^) in a tubular e- neighborhood 
of Cgen, and particularly on the issue of stability. Let g again denote the auxiliary Kahler 
metric introduced in section |2l with the induced Riemannian distance function given by 
dist^ (in contrast to the Carnot-Caratheodory distance function dc-c induced by g, which 
will also be considered). We recall that a point a G C^en is stable if there exists 5{e) > for 
every e > 0, so that for all t, dist^{^t{x)-, a) < t holds for all x with distR{x, a) < S{e). 

To elucidate the key differences between the local dynamics in the vicinity of (Lgen for 
the cases of integrable and non-integrable V, let us first describe the situation where V is 
integrable. As proved in corollary 123 M is foliated into 2/i;-dimensional symplectic subman- 
ifolds which intersect (tgen transversely. Thus, on every leaf A/", the equilibrium solutions are 
generically isolated points. Let the linear operator Qa correspond to the linearization of 
on TqM for some a E Af H ^gen, and restricted to the fibre Va = TaAf C T^M. Its spectrum, 
if it is not purely imaginary, conclusively characterizes the stability of a; we refer to this 
as the asymptotically (un) stable case. If the spectrum of Qa is purely imaginary, which we 
refer to as the critically stable case, is is well-known that if there exists a local Lyapunov 
function Lj^ : U{a) fl A/" ^ M for a, then a is stable. 

If V is non-holonomic, the situation is similar in the asymptotically (un)stable case, 
but drastically different in the critically stable situation. In the critically stable case, the 
presence of a local degenerate Lyapunov function for a single equilibrium a G Cgen is of 
limited use, since there is a whole submanifold (tgen H S^; (the H-level set for the energy E of 
the initial condition) of valid equilibria for a given energy E. One may relax this condition 
to the existence of the following class of functions. 

Definition 3.1. Let denote the component of the gradient normal to <tgen with 
respect to g at €gen- Let U{a) be a distn-small open neighborhood of a E ^gen- A local, 
degenerate, almost Lyapunov function for a is a class function L : U{a) —>■ M., which 
satisfies (V^L)(a') = for all a' G Cgen H U{a), and ||VgL||g > for all x G U{a) \ ^gen- 
Furthermore, {'VgdL)\a' is positive definite quadratic form on Na'^gen for alia' G U{a)r\^gen, 
and L{^t{xo)) < L{xo) for all xq G U{xo), and all t such that $t(xo) G U{a). 
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Notably, L defined here is not a local degenerate Lyapunov function, because fl U{a) 
is not a critical level set (we remark that this would be equivalent to L being a Morse-Bott 
function in U{a)), on which L is extremal. 

While the existence of L guarantees that the orbit $t(xo) remains within a tubular 
e-neighborhood of ^gen H U{a) for all t such that $f(xo) G f/(a), it does not imply stability 
of a G Cgen n U . There is an additional, necessary condition on the rational independence 
of the frequencies of the oscillatory linear problem that must be imposed. Otherwise, an 
inner resonance, connected to the appearance of small divisors, occurs, and $t(xo) G U{a) 
may evolve away from a, in a diffusive motion along the higher flag elements of V that are 
approximately tangent to C^en, while along V , which is transverse to C^en, the motion is 
bounded and oscillatory. 

From the analysis in section |21 it is clear that for every connectivity component C 
^gen of index ^{^i) = (with respect to (pt), the Hamiltonian if is a local degenerate, 
almost Lyapunov function for all of its points. The minimum a* of H\(r. on (ti is a local 
minimum of H, and hence stable (since H serves as a Lyapunov function for a*). Hence, in 
particular, if V is integrable, all points on are stable if /i(Ci) = 0. We also note that on 
the connectivity components <tj with index fi{*tj) > 0, H is never a local degenerate, almost 
Lyapunov function. 

The main focus in this section will be to discuss issues of this type. However, an 
essential part of sections l3 . L 21 and 13 . 2 . 21 will be in mathematically non-rigorous terms, since 
a rigorous treatment of the matters addressed there would fall into the domain of KAM and 
Nekhoroshev theory, and is beyond the scope of the present work. 

A concrete aim in this discussion is to arrive at stability criteria for equilibria of the 
constrained Hamiltonian system {M,uj, H,V). From an instructive, despite elementary, ap- 
plication of averaging theory, we conjecture a condition for the critically stable case that 
involves an incommensurability condition imposed on the frequencies of the linearized prob- 
lem, as remarked above. In order to elucidate its geometric content, we study the dynamics 
in the vicinity of a critically stable equilibrium in a geometrically invariant form that is 
adapted to the flag of V. Invoking a perturbation expansion based on this description, we 
argue that the incommensurability condition, which might merely correspond to an artefact 
of the averaging method, cannot be omitted. A rigorous proof of the conjectured stability 
criterion is left for future work. 
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3.1. Stability Criteria. Let a G C^en? and pick some small neighborhood U{a) C M 
together with an associated Darboux chart, with its origin at a. The equations of mo- 
tion are given by dtXt = P{xt)JH^x{xt) = X]^(a;t), where the coordinates are given by 
X = (x^, . . . , x", Xn+i, . • . , X2n), and JT" is the symplectic standard matrix. Furthermore, 
abbreviates d^H, and P is the 2n x 2r;,-matrix representing the tensor vry. tu-skew orthogo- 
nality of TTv translates into P{x)J'X{x) = J'P'^{x)X{x) for all vector fields X. 

Proposition 3.1. There exists a chart in which the equations of motion have the form 

(27) dtivu zt) = {noyt + Y{zt, yt) , Z{zt, yt)) G m"' x M^C-a^) . 

In particular, Qq corresponds to the restriction of DX}^{0) to Vq, and \Y{y,z)\, \Z{y,z)\ = 
0{\y\\z\) + 0{\y\^). 

Proof. In a sufficiently small vicinity U C M^" of the origin (corresponding to a), one infers 
from corollary 12.11 that TaCgen © Vq = M^", for a G f/ fl ^gen- Accordingly, we choose 
local coordinates z G f/'(0) C R2(""'=) on Cg^n, and y G Vq, noting that the decomposition 
X = a{z)+y for any x E U G M^" is unique, where a : R^"~'^ "—^Uis the (smooth) embedding. 
Let y denote the coordinates of y with respect to some family of basis vectors for Vq. Then, 
( I77j) evidently follows from Taylor expansion. □ 

3.1.1. Asymptotic (In) stability. If spec{i7o} U = 0, there exists, by the center manifold 
theorem, a coordinate transformation {y,z) —>■ {y,z), such that ( ITFj) becomes 

(28) dt{yt,Zt) = {noyt + Y{yt,Zt), O) 

llOj, where Y{0,z) = for all z. Thus, a G C^en is asymptotically unstable if there are 
eigenvalues with a positive real part, and asymptotically stable if all eigenvalues have a 
negative real part. If V is integrable, asymptotic stabihty is impossible, because the dynamics 
is Hamiltonian on every integral manifold. However, if V is non-integrable, there is, to the 
author's knowledge, no obstruction to the existence of asymptotically stable equilibria, since 
the flow map is not symplectic. 

3.1.2. An Elementary Application of Averaging Theory. In the case of critical stability, one 
has specjfio} = {i^^i, ■ ■ ■ , i^^2k}, with cuj G M \ {0} for i = 1, . . . , 2k. Let us for the context of 
an averaging analysis assume that the vector fields on the r.h.s. of (|2ZI) are real analytic with 
respect to {y, z). We apply a complex linear coordinate transformation that diagonalizes Qq, 
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and denote the complexified, new coordinates and vector fields again by {y, z), and Y{y, z), 
Z{y,z), respectively, by which we find 

(29) dtiyt, zt) = (diag(z^)yi + Y{yt, Zt) , Z{yt, z^)) e C^^^ x C2("-'=) , 

where uj := (cji, . . . ,uj2k)- Complexifying (EZj), the continuation of (tgen into C^" is defined 
as the common zeros of 1^(0, z) and Z{0, z) for z G C^("-'=). 

We next introduce polar coordinates (/, 0) G M?^ x [0, 2txY^ and (J, d) G M^^""^) x 
[0, 27r)^("~'^) in terms of y"^ =: e^'I'^F and =: e*^^ J*, with r = 1, . . . , 2fc and s = 1, . . . , 2n — 
2A;. In particular, / G M^^', J G R^"-^'^, g [0,27r]2'= = T^'^ (the 2A:-dimensional torus), 
and e G [0, 27r]2"-2'= = T2n-2fc_ brevity, let e'^v := {e'^H\ . . . .e^'t'^^v'^^) and e^^w : = 
{e'^^w^, . . . ,e*''2(n-fc)^y2{n-fc))^ ^ g ^2fc g^j^^ ^ ^ C2"-2fc, ( 12^)) is then easily seen to be 

equivalent to (the dot abbreviates dt) 

(30) i = Re{e-''^Y{e'^I,e'^J)} , ^ = u + \m{e-''^diixg{di)Y{e''^I ,e'^ J)}. 

(31) j = Re{e-*^Z(e*^J,e^^J)} , ^ = Im{e~^^diag(9j)Z(e^'^J, e^^ J)}. 

Let us assume that e := |/(0)| -C 1, and | J(0)| < O(e^). We then introduce rescaled variables 
/ — > eJ and J e^J. 

Analyticity of Y{y, z) and Z{y, z) with respect to {y, z) implies that the power series 
expansion with respect to e^'^I and e*^J converges for e sufficiently small. Accordingly, ( I3U|) 
and ( EH) yield 

(32) F = el"^l+2lPl-i^^^(j^ j)ei((m,0>-0.)e^(p,^?> 

|m,| + |p|>2 

(33) r = e'"'+''^'"'G'^p(/,^)e'^'"''^^e'<'''^> 

|m| + |p|>2 

(34) = UJr+ Yl e'"'+'''''"'$r;mp(/, J)e'(<'"'^>-^'-^e'<P'''> 

m| + |p|>2 

(35) = Y el"^l+2|^l-2e,;^p(/,J)e^<™'*>e^<^''^>, 

|m,| + |p|>2 

where we introduced the multiindices m G Z^'^ and J9 G with |m| := ^ |mr| and 

\p\ '■— Yl \Ps\- In this Fourier expansion with respect to the 27r-periodic angular variables (p 
and 6, every Fourier coefficient labeled by a pair of indices (m, p), is a homogenous polynomial 
of degree |m| in J, and of degree \p\ in J. 

If the components of u are all mutually rationally independent, one may consider the 
averaged quantities /<(</>) ft '■= {'^'^) "'Jjnd(t>ft{(p). From (EHl), Y{y,z) and Z{y,z) are 
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0(|y|), thus their power series involve terms e*^"*''^^ with \m\ > 1, but none with \m\ = 0. 
Averaging ( IH^ ~ ( with respect to thus gives 

(36) I = e^F{I,J,e) , J = , 1 = 

for some function F, where the bars account for averaged variables. Thus, if we in addition 
assume that there exists a local degenerate almost Lyapunov function with respect to £ge„nf/, 
it follows for the averaged equations of motion that |/| is bounded for all t. In particular, if 
the incommensurability condition holds, \J\ is then also bounded for all t, and a (respectively 
0) is, for the averaged system, stable. Based on these insights, and on intuition stemming 
from KAM and Nekhoroshev theory, it is thus natural to conjecture the following stability 
criterion. 

Conjecture 3.1. Let €i C €gen be a connectivity component of the critical manifold, and 
let a G €i, with spec{DX^{a)} \ {0} = {iuJi, . . . , iuj2k}, and c<jj G M \ {0} for i = 1, . . . , 2k. 
Assume that (1) the frequencies oOr are rationally independent, and (2) that there exists a local 
degenerate, almost Lyapunov function with respect to Cjnf/(a), in the sense of definition \3.1i 
Then, a is stable in the sense of Nekhoroshev. In particular, condition (2) is always satisfied 
(by the Hamiltonian H) if the index of (ti is ^{^i) = 0. 

3.2. The Relationship to Sub-Riemannan Geometry. To elucidate the geometric na- 
ture of the requirement of rationally independent frequencies, we will now approach the 
discussion of critical stability from a different point of view. This discussion involves issues 
that are central to sub-Riemannian geometry, ^ ITHl IT^ I34j . 

We study the time evolution map in a tubular e-vicinity of U{a) fl C^en by invoking a 
geometrically invariant Lie series that is adapted to the elements of the flag of V. By an 
asymptotic analysis, we explain the mechanism by which an instability can arise. The reason 
is that if the eigenfrequencies of the linear problem are not incommensurable, the problem 
of small divisors appears. This picture seems to be familiar from the perturbation theory of 
integrable Hamiltonian systems, but we note once more that the lack of integrability here 
originates from the non-holonomy of the constraints. A rigorous treatment of this last part 
of the analysis is beyond our current scope, and left for future work. 

3.2.1. Dynamics Along the Flag of V. Let U denote a small open neighborhood U of a G €, 
and assume that €.gen := C fl f/ satisfies the genericity condition of theorem 12. II 
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Lemma 3.1. Let (tgen = ^Hf/ have the genericity property formulated in theorem \2.1\ Then, 
there exists e > such that every point x eU with dji{x, ^gen) < € is given by 

X = expg Y{a) , |s| < e 

for some Y G r(V") with \\Y\\gj^,j- < 1, a G €gen fe^Ps ^ denotes the 1-parameter group of 
diffeomorphisms generated by Y , with expg Y =id). 

Proof. We choose a spanning family {Yi G Tiy)}^^ of V, with Hl^iHg^^ = 1. If for all a G C^en, 
Ta^gen Contains no subspace of Va, then 

exp^{tiYi + --- + t2kY2k){<tgen) H U 

is an open tubular neighborhood of C^en in U, for tj G (— e, e). Because the normal space 
Na^gen IS dual to the span of the 1-forms dFi at a, this condition is satisfied if and only if the 
matrix [dFjiYi)] = [Yi(Yj{H))] is invertible everywhere on Cgen- According to proposition 12 .If 
this condition is indeed fulfilled. □ 

Hence, there is an element Y G r(V") with ||l^||gjv/ ^ 1) so that x = for some 

< e < 1. Since a G ^gen^ it is clear that under the flow generated by X^, $±t(ct) — a, thus 
the solution of ( ^ with initial condition x is given by 

^l{a) := o ^.(a) = {^t o o ^-t) (a) . 

is, in particular, the 1-parameter group of diffeomorphisms with respect to the variable e 
that is generated by the pushforward vector field 

(37) Yt{x) := ^t.Y{x) = d^t oY{^_t{x)) , 

where d^t denotes the tangent map associated to From the group property Yg+t = ^s*Yt 
follows that 

(38) dtYt = dsl^,^,,Yt = , 
everywhere in U. 

Next, we pick a local spanning family {Yi G T(V)}f^^ for V that satisfies uj{Yi, Yj) = Jij, 
with J ■= If ^' Furthermore, defining 9i{-) := u!(Yi, ■), vry = J^^Yi ® 9j, where J*-' 

are the components of J^^ = — J. In particular, 

X]; = Mx^) = - y^{H) Y, 

in the basis {Yi\l^^. 
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The following proposition characterizes the orbit emanating from x in terms of nested 
commutators with respect to Yf. 



Proposition 3.2. Let f,Fi E C°°{U), where Fi := Yi{H), i = 1, . . . ,2k, and assume that 
Fi(^f*(a)), fi^lia)) are real analytic m e. For X,Y E T{TM), let 

CyX = [Y,...,[Y,X]] 

denote the r-fold iterated Lie derivative. Then, for sufficiently small e, 

(39) dJi^M)) = -F,(vI/*(a))J^'^5^^(/:;,n)(/ovI;*)(a) . 

r>0 

Proof. Clearly, 

djma)) = XUf)ma)) 

= -F,{¥M))r'Y,{f){¥,{a)) 

(40) = -F,{^l{a))r'{¥„Y,){fo¥^){a). 

Using the Lie series \E'* ^Yk = J2r Tl^rXk, we arrive at the assertion. □ 

Proposition 3.3. Assume that Yt=o E r(V), and let {Yj^f' he the given local spanning 
family ofV. Then, CyYj E r(Vi), where Vi is the i-th flag element ofV. 

Proof. Since $f * : T{V) r(^), Yt is a section of V for all t if it is for t = 0. The claim 
immediately follows from the definition of the fiag of V. □ 

Proposition 13.31 implies that there are functions a*(t, ■) E C°°{U), i = 1, . . . , 2k, so that 
Yt{x) = a''{t,x)Yi. The next proposition determines their time evolution. 

Proposition 3.4. Let Yt=o = Og^j define the initial condition, and introduce the matrix 

:= [Yi{F,){x)J''^] . 

Then, pointwise in x, 

(41) a™(t,x) = (exp(-t^],))74 + F,(x)i^^(^,a;)4, 
where 

RT^t, x) := Pr" ^ ds{exp{-{t - s)fi,))^^ ([Yi, <I,,F,], y„ 
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Proof. The initial condition at t = is given by Yq = Q-o^i, that is, by a*(0,x) = Oq. Thus, 
by the definition of Yf in ( IH7|). one has Yf = a^^t^^Yi, so that a'^{t,x)Yi = a^^t ^,Yi. From 

^O^iy^j) ~ Jiji Jik — ~Jki Slid JimJ^^ = 

Now, taking the t-derivative on both sides of the equahty sign, one finds 

= -a\t,x)Yi{Fj){x)J^'^ 

-d^F,{x)Pj^^uj([Yi,^t.Y,],Yu) . 

Using the variation of constants formula pointwise in x, one arrives at the assertion. □ 

3.2.2. N on-holonomy and small divisors. Using the description of the dynamics in the vicin- 
ity of a derived above, we will here use the small parameter e for an asymptotic expansion. 
The intention of this part of the discussion, which is not rigorous, is to explain the geometric 
origin of the incommensurability condition on frequencies in conjecture 13.11 
We consider the following simplified situation: 

(1) VLx = 0,, constant for all x in U. 

(2) specjfi} = {iuji, . . . , iuj2k}, with Ur G R. 

(3) := sup^ \uJr\ ^ 7- 

Let us briefiy comment on the generic properties of {ujr}. Writing Q = JA, we de- 
compose the matrix A = [Yi{Yj{H)){a)] into its symmetric and antisymmetric parts A+ and 
y4_, respectively. A_ = [[1^-, Yi\{H){a)]/2 vanishes if V is integrable, which one deduces from 
^nla £ for all a G *tgen, and the Frobenius condition. The linear system a = JAj^a is 
Hamiltonian, hence the spectrum of JAj^ consists of complex conjugate pairs of eigenvalues 
in iM. if it is purely imaginary (here, a := (a^, . . . , a^'^)). Considering JA_ as a perturbation 
of we may generically assume that all frequencies uj^ are distinct from one another, 

and that there are both negative and positive frequencies. 

Under the simplifying assumptions at hand, let us compute ( HH) to leading order in e. 
From ( , one infers 

Yt = a^o(exp(-^^]))}y, + Y^o{\xm , 

i 
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since = 0(|2;|) = 0(e), which follows from Fj{a) = 0. Thus, 

i i 

for all X G r(TM), and x G U^{a). Assuming that all objects in question are C°°, iterating 
the Lie bracket Cyt f times produces 

r 

m=l 

plus a series of terms with less than r nested Lie commutators that contribute to higher 
order corrections. 

Let us, for the discussion of the leading order terms along each flag element of V , omit 
the relative errors of order 0(e). By the assumption of smoothness, our considerations are 
valid for t < 0(e^^). Let us consider the term 

(42) F,{^l{a))J^' {CyY,){f o^>\){a) , 
for fixed r. It is easy to see that 

(43) F,(vl/*(a)) = F,(F,)(a) + 0(e2), 
due to -Fj(a) = 0. Therefore, 

(44) F,{^\{a))J^' = eexp {-tn)'; a^nt + 0{e') , 

from a straightforward calculation. 

Hence, the terms with r nested commutators in ( are 

^a^exp(-t^])l^. ( n air (eM-tn))'- ) [Y,„ [. . . , [F,,, F,] • ■ ■ ]](/)(a) 

m=l 

+0(e^+2) , 

as long as dzst/j(^*(a), a) < 0(e). This implies that for / G C°°{U), 

r>0 ^' •'^ 

r 

(45) ( n < i^M-s ) [^n, • • • , [y^r. Yi] ■ ■ ■](/)(«) , 

m=l 

up to relative errors of higher order in e for every fixed r. 

If / is chosen as the i-th. coordinate function x\ so that /(\l/*(a)) = x\, the quantity 
[Fj^, . . . , [Fj,., Yi\ - ■ ■ ](/)(a) is the i-th coordinate of the vector field defined by the brackets 
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at a. Consequently, ( ESI) is the component decomposition of relative to the flag of V at 
a, to leading order in e. 

By the given simplifying assumptions, spec{f2} C z]R\ {0}, and the norm of exp(— 
is 1, independently of s. Consequently, the integrand of is bounded for all s. It follows 
that if the r-th integral in the sum diverges, it will become apparent only for t > O(^). 
This would correspond to an instability along the direction of the flag element V^. While 
the leading term with r = is bounded for all t, terms with r > can diverge. 

We next write 

2k 

(46) a(s) = exp(— si7)ag = ^a^g exp^—iugs) , 

g=l 

where {e^} is an orthonormal eigenbasis of Q with respect to the standard scalar product in 
C^'^, and specjfi} = {iujg}. The amplitudes Ag G C are determined by the initial condition 
a*(t = 0) = Oq, which we assume to be nonzero. By linear recombination of the vector fields 
Yi, one can set = 6i^g. Then, ( 1^ can be written as 

r>0 l;ii,...,ir 

where 

/•t 

-'^ m=l 

,48) ^ ^^f' (n^„)(e-.--^EU....-i). 

We note that the sum of frequencies (of generically indefinite signs) in the denominator on 
the last line raises the problem of small divisors. We also remark that evidently, the nested 
commutators vanish if all indices ^l, . . . , v?^ have equal values, as it should be (otherwise, 
the solutions would always diverge). 

3.2.3. Rational frequency dependence and blow-up of solutions. Let us next discuss the situ- 
ation in which the small divisors approach zero. To this end, we introduce the set 

(49) aW(t) := {Ii,,,...,M}Z=i \ {hi,..At)}ti ^ 

which we endow with the norm || J*^''^(t)|| := supj(()g3{r)(() 1-^(^)1? and let ||y4|| := supj^j^ 2j!j{|y4j|}, 
where Ai are C-valued amplitudes. 
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Furthermore, let 2t := {ui, . . . , uj2k}, and let 

(50) := 21+ ■ ■ ■ + 21 , 

r times 

denote its r-fold sumset, which is the set containing all sums of r elements of 2t. 
For two sets of real numbers 21 and 03, we define 

(51) c/(2l,Q3) := inf ||ai G 21 , 6^- G Osj . 
Then, it follows from (HHI) that if ^(21^, -21) > 0, 

(52) ||j(")(t)|| < c/(2t„-2l)-ir]| 



(the sum over frequencies Ylm=i^jm ( EHI) is an element of 21^, and can only equal —ui 
if (i(2lr, —21) = 0). However, if d{%r, —21) = 0, there is a tuple of indices . . . , ir} such 
that 

r 

(53) /;;,,,...,i,(t)=-t^iAn^i™' 

m=l 

m case of which p(''H^)|| ~ t, that is, a divergence linear in t for large t (recalling that the 
present asymptotic considerations require t < e~^). Only if there are simultaneously positive 
and negative frequencies, d(JHr, —21) = is possible, but due to the remark at the beginning 
of subsection 13.2.21 this situation must generically assumed to be given. 

As an illustration, the following picture holds for r < 2. The fact that for r = 0, 
||J*^°^(t)|| is bounded for all t is clear. For r = 1, the first flag element Vi = [V,V] is in 
question. The condition for the emergence of a divergence is that (i(2t, — 2t) = 0. This is 
precisely given if there is a pair of frequencies icuj of equal modulus, but opposite sign. For 
r = 2, assuming that (i(2l, —21) > 0, the condition (i(2l2, —21) = implies that there is a 
triple of frequencies such that + = —uji^, ij G {1, . . . , 2k}. If this occurs, the solution 
will diverge in the direction of the second flag element, V2 = [V, [V, V]]. The discussion for 
r > 2 continues in the same manner. 

Hence, our conclusion from this asymptotic analysis is that if (i(2lr, —21) = for some 
r, then ||aW(t)|| = 0{t) for t 00. 

The physical insight gained from the above discussion can be summarized as follows. 
If the frequencies of the linearized problem fail to satisfy the incommensurability condition 
(i(2t,., —21) > for all r, the equilibrium a is unstable. However, the time required for an 
orbit to exit from a Riemannian e-neighborhood U^{a) is very large. In fact, assuming that 
(i(2l,., —21) = for some r < deg(l^) (the degree of non-holonomy of l^), a time t ~ 0{^) 
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is necessary to exit from Ue{a) in the direction of the flag element Vr (due to the factor ^ 
in (lini)). We note that the orbit does not drift out from Ue{a) fl Cgen in the direction of Va 
owing to the existence of a local degenerate Lyapunov function required in the conjectured 
stability criterion. Therefore, this discussion suggests that the incommensurability condition 
imposed on the frequencies of the linearized system can indeed not be omitted. 

3.2.4. Instabilities in the Context of Carnot-Caratheodory Geometry. The constrained Hamil- 
tonian system {M,uj, H,V) shares many characteristics with systems typically encountered 
in sub-Riemannian geometry [3 El 01] • The natural metric structure in this context 
is given by the Carnot-Caratheodory distance function distc-c induced by the Riemannian 
metric g. It assigns to a pair of points x,y & M the length of the shortest ^-horizontal 
^f-geodesic. 

If V satisfies the Chow condition, distc-c{x,y) is finite for all x,y G M, by the 
Rashevsky-Chow theorem [21^^. In this case, the Carnot-Caratheodory e-ball 

B^-^{a) := G M distc-c{x,a) < e| 

is open in M. 

If V fails to satisfy the Chow condition, pairs of points that cannot be joined by 
l^- horizontal (7j\/-geodesics are assigned a Carnot-Caratheodory distance oo. Then, M is 
locally foliated into submanifolds Nx of dimension (2?2 — rankVdeg{y)) (we recall that degiV) 
denotes the degree of non-holonomy of V), with A in some index set, which are integral 
manifolds of the (necessarily integrable) final element Vdeg(v) of the flag of V. On every 
A^A, the distribution Vx := jlV satisfies the Chow condition, where jx : Nx ^ M is the 
inclusion. Therefore, all points x, ?/ G Nx have a finite distance with respect to the Carnot- 
Caratheodory metric induced by the Riemannian metric JxOm- Every leaf A^a is an invariant 
manifold of the flow 

Let {^j^j^l^i^^ denote a local spanning family of TM such that {Yi^} spans the flag 
element V^. Let the (yf-length of all l^i^'s be 1. Then, we define the 'quenched' box 

dcg{V) dimVr 

Box,(x) := jexpi ( ^ e^' ^ t,^ F,,)(a;) | t,^ G (-1,1)} 

r=l ir=l 

in A^A, where A is suitably picked so that x G A^a- Evidently, if V satisfies Chow's condition, 
A^A = M. According to the ball-box theorem jSlE]) there are constants C > c > 0, such 
that 

BoXc,(x) C Bf~^{x) C Boxce{x) . 
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Carnot-Caratheodory e-balls can be approximated by quenched boxes in Riemannian geom- 
etry. 

The above perturbative results imply that if there is some r < deg{V), for which 
d{%., —21) = 0, the flow l>t blows up the quenched boxes, and thus the Carnot-Caratheodory 
e-ball around a G Cgen, linearly in t, and along the direction of Vr- In fact, B^~^{a) is 
widened along Vr at a rate linear in t. For t = O(^), $t maps the Carnot-Caratheodory e-ball 
containing the initial condition to a Carnot-Caratheodory ball of radius 0(1). Thus, in the 
context of Carnot-Caratheodory geometry, these instabilities, which have no counterpart in 
systems with integrable constraints, are far more significant than in the Riemannian picture. 

4. AUTONOMOUS NON-HOLONOMIC SYSTEMS IN CLASSICAL 

MECHANICS 

In this main section, we focus on the analysis of non-holonomic mechanical systems, 
and their relationship to the constrained Hamiltonian systems considered previously. The 
discussion is restricted to linear non-holonomic, Pfaffian constraints. 

Let {Q,g,U) be a Hamiltonian mechanical system, where Q is a smooth Riemannian 
n-manifold with a metric tensor g, and where U G C°°{Q) denotes the potential energy. 
No gyroscopic forces are taken into consideration. Let g* denote the induced Riemannian 
metric on the cotangent bundle T*Q. For X G r(TM), let 6x be the 1-form defined by 
0x(Y) = g{X,Y) for all Y G T(TQ). Clearly, g(X,Y) = g*(ex,9Y) for all X,Y e T(TQ). 

The Kahler metric of the previous discussion, also denoted by g, will not appear in this 
section. From here on, g will denote the Riemannian metric on Q, which should not give 
rise to any confusion. 

In a local trivialization of T*Q, a point x G T*Q is represented by a tuple {q^,Pj), 
where are coordinates on Q, and are fibre coordinates in T*Q, with i, j = 1, . . . ,n. The 
natural symplectic 2-form associated to T*Q, can be written in coordinates as 

cuq = dq^ A dpi — —dO^ . 

i 

6*0 = Pidq'' is referred to as the symplectic 1-form. 

We will only consider Hamiltonians of the form 

(54) H{q,p) = ^g;{p,p) + U{q). 
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In local bundle coordinates, the corresponding Hamiltonian vector field Xh is given by 

i 

The orbits of the associated Hamiltonian fiow $t satisfy 

(55) q' = dp^H{q,p) , pj = ~dg,H{q,p) . 

The superscript dot abbreviates dt, and will be used throughout the discussion. 

Let Aj denote the space of smooth curves j : I G M ^ T*Q, with I compact and 
connected, and let t denote a coordinate on R. The basis one form dt defines a measure on 
M. The action functional is defined by I : Aj M, 

(56) X[7] = J^dt (7*^0 - Ho^) 

= j^dt (J2p^itmt)-H{q{t),p{t))) , 

with 7 = J2i'i^^q' + Pi^Pt)- Denoting the base point projection by vr : T*Q — > Q, let 
c := (tt o 7) : / — > Q denote the projection of 7 to Q. We assume that ||c(/)|| is sufficiently 
small so that solutions of f I55|) exist, which connect the end points c{dl). Among all curves 
J : I —>■ T*Q with fixed projected endpoints c{dl), the ones that extremize X are physical 
orbits of the system. 

4.1. Linear Non-Holonomic Constraints. Let us next impose linear, 'Pfaffian' con- 
straints on the Hamiltonian mechanical system {Q, g, U), by adding a rank k distribution W 
over Q to the existing data, and invoke the Holder variational principle, [3.^, that generates 
the correct physical fiow on T*Q. The orbits of the resulting constrained dynamical system 
possess W^-horizontal projections to Q. 

We introduce the ^f-symmetric projection tensor associated to W given by pw = Pw '■ 
TQ TQ, with 

Ker(pvK) = , pwiX) = X VX G r(Tg) , 

and its orthogonal complement pw = 1 — Pw- We note that in local coordinates, pw is 
represented hj a n x n matrix of rank k. The dual of W, denoted by W*, is defined as 
the image of W under the isomorphism g : TQ T*Q, and likewise for (W*)-^ '■= 9 ° W* . 
The corresponding (7*-orthogonal projection tensors on T*Q are denoted by p\^ and 
respectively. Our inspiration to introduce pw and pw for this analysis stems from [TT] . 
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4.1.1. Dynamics of the Constrained Mechanical System. Next, we derive the equations of 
motion of the constrained mechanical system from the Holder variational principle. For a 
closely related approach to the Lagrangian theory of constrained mechanical systems, cf. 

Definition 4.1. A projective W -horizontal curve in T*Q is an embedding 7 : / C M ^ T*Q 
whose image c = tt o 7 under base point projection 7c : T*Q Q is tangent to W. 

Let 7s : / — T*Q, with s G [0, 1], be a smooth one parameter family of curves for 
which the end points Cs{dl) are independent of s (where Cg := n o 7^). 

Definition 4.2. A W -horizontal variation of a projective W -horizontal curve 'y is a smooth 
one parameter family 7., : M ^ T*Q, with s E [0, 1], for which ^(tt o 7^) is tangent to W , 
and 7o = 7. 

Let 

4>\t) := ds\^^Qq\s,t) , Xk{t) := ds\^^QPk{s,t) . 

To any ly-horizontal variation 7^ of a ly-horizontal curve 70 with fixed projections of the 
boundaries 

(57) (7ro7,)(9/) = (7ro7o)(9/), 
so that 0*1 a/ = 0, we associate the action functional 

Als] = {^Pr{sM{s,t) - H{q{s,t),p{s,t))) dt . 

Definition 4.3. (Holder principle) A physical orbit of the constrained mechanical system 
{Q, g,U,W) is a projective W -horizontal curve 70 : / ^ T*Q that extremizes l^ls] among 
all W -horizontal variations 7^ which satisfy (\51^. 

Hence, if 

(58) (5X[7.] = $^P.0ia/ + y|"$^((p.-9,,i/)0^-(g^ + 9,^i/)x.) = 

for all horizontal variations of 70 that satisfy (jf\di = 0, then 70 is a physical orbit. 
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Theorem 4.1. In the given local bundle chart, the Euler- Lagrange equations of the Holder 
variational principle are the differential-algebraic relations 

(59) q = pw{q)dpH{q,p) 

(60) Pw{q)p = -Pwi.^)dqH{q,p) 

(61) pwiq)dpH{q,p) = 0. 

Proof. The boundary term vanishes due to (f)^\di = 0. 
For any fixed value of t, one can write 0(t) as 

k 
a=l 

where Yq, is a (jf-orthonormal family of vector fields over c(/) that spans Wc{i)- Furthermore, 
fa G C°°{c{I)) are test functions obeying the boundary condition fa{c{dl)) = 0. 

Since and x ^i-re arbitrary, the terms in ( ISHj) that are contracted with 0, and the 
ones that are contracted with x vanish independently. In case of (p, one finds 

^ dtfa {p + d,H),Y: =0 

for all test functions fa- Thus, (p + dqH)iY^ = for all a = l,...,k, or equivalently, 
PwiP + ^qH) — 0> which proves f lUUj) . 

Since 7o is l^-horizontal, pwil)'! = 0, so the x-dependent term in 5X[7s] gives 

j^dt (q- pwdpHy {plyx)i + J^dt {pwdpHy {pl,x)i = Q ■ 

The components of x in the images of p\y{q) and ply{q) can be varied independently. Thus, 
both terms on the second line must vanish separately, as a consequence of which one obtains 
(EHI) and (EH). □ 

Definition 4.4. The smooth submanifold 

S := {{q,p)\pw{q)dpH{q,p) = o} C T*Q 
locally characterized by f \()l]) is called the physical leaf. 

S contains all physical orbits of the system, that is, all smooth paths 7 : M — iS C T*Q 
that satisfy the differential-algebraic relations of theorem 14.11 

Theorem 4.2. Let H be of the form ( \54\j- Then, there exists a unique physical orbit 
7 : ]R+ S with 7(0) = X for every x E S. 
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Proof. We cover S with local bundle charts of T*Q with coordinates {q,p)- For the Hamil- 
tonian ( |33)) , ( IHTj) reduces to 

pw{q)9'\q)p = 9~^{q) pwiQ) p = 0, 

where one uses the ^f-orthogonality of pw- Hence, (EH) is equivalent to p|y(q')p = 0. Since 
S is the common zero level set of the n component functions iplv{q)p)i, every section 

X = v\q,p)dqr + Ws{q,p)dp^ 

of TS is annihilated by the 1-forms 

d{pwP)t = dgr {p\yp)idq'' + dpXpwP)idPs 

for z = 1, . . . , n (of which only n — k are linearly independent), on S. 
This is expressed by 

= {V''dqr)p^^p + (Ws^pJpJ^P 

= (^''^gOPw/P + Pw^ , 

which shows that the components f of X determine the projection Py^w. Hence, the com- 
ponents V and p\yW suffice to uniquely reconstruct X. Consequently, the right hand sides of 
( En} and ( inO)) determine a unique section X of TS, so that every curve 7 : S, with 

arbitrary 7(0) G S, that satisfies dt'-yit) = X(7(t)) automatically fulfills ( 13^ ~ (EH)- This 
proves the assertion. □ 

4.1.2. Equilibria. The constrained Hamiltonian mechanical system {Q, g,U,W) possesses 
(62) (^Q := {q e Q\pUQ)dM(l) = 0} 

as its critical set. An application of Sard's theorem fully analogous to the proof of theorem l2.1l 
shows that generically, this is a piecewise smooth, ra — fc- dimensional submanifold of Q, (recall 
that the rank of pw{q) is k). 

4.1.3. Symmetries. Let G be a Lie group, and let ip : G Diff(Q), h ^ iph with '^e = id, 
denote a group action. The constrained Hamiltonian mechanical system {Q,g, U, W) is said 
to exhibit a G-symmetry if the following hold. (1) Invariance of the Riemannian metrics: 
goipi^ = g and g* oiph = 9* for all h E G. (2) Invariance of the potential energy: U oipf^ = U 
for all h e G. (3) Invariance of the distributions: iph*W = W and i^lW* = W* for all h e G. 
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4.2. Construction of the Auxiliary Extension. We are now prepared to embed the non- 
holonomic mechanical system into a constrained Hamiltonian system of the type considered 
in the previous sections. 

To this end, we will introduce a set of generalized Dirac constraints over the symplectic 
manifold {T*Q,uJo) in the way presented in section ^ They define a symplectic distribution 
V, in a manner that the constrained Hamiltonian system {T*Q,ujq, H,V), with H given by 
( I54j) ■ contains the constrained mechanical system as a dynamical subsystem. Thus, the 
auxiliary constrained Hamiltonian system {T*Q, uq, H, V) extends the mechanical system in 
the sense announced in the introduction. An early inspiration for this construction stems 
from [22] • We require the following properties to be satisfied by (T*Q, uq, H, V). 

i. S is an invariant manifold under the flow $t generated by (EJ. 

ii. All orbits ^(x) with initial conditions x & S satisfy the Euler-Lagrange equa- 
tions of the Holder principle. 

iii. S is marginally stable under 

iv. The critical set €. of $f is a vector bundle over (Eq, hence equilibria of the 
constrained mechanical system are obtained from equilibria of the extension 
by base point projection. 

V. Symmetries of the constrained mechanical system extend to those of 

Let us briefly comment on (iii) ~ (v). (iii) is of importance for numerical simulations 
of the mechanical system, (iv) makes it easy to extract information about the behaviour 
of the mechanical system from solutions of the auxiliary system. Condition (v) allows to 
apply reduction theory to the auxiliary system, in order to reduce the constrained mechanical 
system by a group action, if present. The choice for V is by no means unique, and depending 
on the speciflc problem at hand, other conditions than (iii) ~ (v) might be more useful. 

4.2.1. Construction ofV. Guided by the above requirements, we shall now construct V. 
To this end, we pick a smooth, 5f*-orthonormal family of 1-forms {C/}/=i with 

Ci = Cikiq) dq^ , 

so that locally, 

({Cl,...,Cn-fc}) = (W^y . 

The deflning relationship p\^{q)p = for S is equivalent to the condition 
(63) fi{q,p)-=9;{p,Ci{q)) = V/ = l,...,n-A;. 
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It is clear that // G C°°(T*Q). 

(1) To satisfy conditions (i) and (iii), we require that the level surfaces 

(64) M^:= {{q,p)\fj{q,p) = = l,...,n-k}, 

with /i := (/ii, . . . ,fin-k), are integral manifolds of Vdeg(y)- Here, deg(V^) denotes the 
degree of non-holonomy of V, and evidently, Aio = <S. 
Condition (iii) is satisfied because 

is an integral of motion for orbits of $t. Since L grows monotonically with increasing 
and attains its (degenerate) minimum of value zero on S, it is a Lyapunov function 
for S. Anything better than marginal stability is prohibited by energy conservation. 

(2) To satisfy condition (ii), we demand that Pw{<i)q = 0, or equivalently, that 

(65) Cj{q) = , V/ = l,...,n-A;, 

shall be satisfied along all orbits {q(t),p(t)) of owing to f I^Uj). 

(3) If the constrained mechanical system exhibits a G-symmetry, characterized by a group 
action ip : G ^ Diff(Q) so that iph^W = W Wh e G, the local family of 1-forms 

can be picked in a manner that iphCi ~ C/ is satisfied for all /i G G in a vicinity of the 
unit element e. Consequently, the functions fi{q,p) = h*{(j,p) and their level sets 
Aifj, are invariant under the group action. 

The condition that ( are integral manifolds of Vdeg(y) 3 V implies that all sections 
of V are annihilated by the 1-forms dfj, for / = 1, . . . ,n — k. Furthermore, the condition 
( requires V to be annihilated by the 1-forms 

(66) := Ciriq) dq' + ^ dps 

s 

that are obtained from lifting (j to T*{T*Q), with I = 1, . . . ,n — k. 

Proposition 4.1. The distribution 

V := [f]keTdfj) fl (flkerO) C T{T*Q) 
I I 

is symplectic. 

41 



Proof. V is symplectic iff its symplectic complement V-^ is. With the given data, the latter 
condition is more convenient to check. V-^ is locally spanned by the vector fields {Yi, . . . , Y2k) 
obtained from 

(67) uoiYi,-) = , u;{Yj+k,-) = dfi{-), 

where I = 1, . . . , k, and = —dpi A dq\ 

V-^ is symplectic if and only if D := [uj{Yj, Yj)] has values in GL^{2{n — k)). 

We remark that in the present notation, capital indices range from 1 to A; if they label 
1-forms, and from 1 to 2k if they label vector fields. 

In local bundle coordinates, 

dfi = {dg.fi){q,p) dq' + Cii{q) g'^ (q) dpj , 

where Qij are the components of the metric tensor g on Q, as before. Let us introduce the 
functions E{q) := [Cji{q)] and F{q,p) := [dgj fK{q,p)], both with values in MatiR(n x (n — k)), 
which we use to assemble 

K := ) ■■ T*Q ^ MatM(2(n - k) x 2n) . 

Any component vector v : T*Q R^" that locally represents an element of T{V) satisfies 
Kv = 0. The symplectic structure c^o is locally represented by J, defined in ([7j). One can 
easily verify that the J-th row vector of the matrix KJ'~^ is the component vector of Yj. In 
conclusion, introducing the matrices 

G{q) := E\q) g-\q) E{q) 
S{q,p) := F^{q,p)g-\q)Eiq) - E^iq)g-\q)F{q,p), 



one immediately arrives at 

(68) D = KJK^ 



G 

-G S 



Since C,i has been picked a 5f*-orthonormal family of 1-forms on Q, it is clear that G{q) = In-k- 
Thus, D is invertible. This proves that V-^ is symplectic. □ 

4.2.2. Construction of the projection tensors. Next, we determine the matrix of the ujq- 
orthogonal projection tensor vry, which is associated to V, in the present bundle chart. 
Again, it is more convenient to carry out the construction for its complement first. 
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Proposition 4.2. The matrix of the ujq- orthogonal projection tensor -Ky associated to 
(considered as a tensor field that maps r{T(T*Q)) to itself, with kernel V) is given by 

Pw 



in the local bundle chart {q,p). The matrix T = T{q,p) is defined in (\7(J{). 
Proof. The proof of lemma 11.11 can be used for this proof. The inverse of ( lUHj) is 



where we recall that G{q) = In-k- The J-th column vector of the matrix KJ~^ is the 
component vector of 1/ (we have required that {Yi, . . . l2(n-fc)} spans V-^). This implies that 

Lemma 4.1. The matrix of pw in the given chart is given by 

(69) -pw{q) = g-\q) E{q) E\q) . 

Proof. The construction presently carried out for vfy can also be applied to pw One simply 
replaces V-^ by W-^, and uoq by the Riemannian metric g on Q. An easy calculation im- 
mediately produces the asserted formula. The matrix of pw is subsequently obtained from 
Pw + pw = 1- For more details, cf. [llj. □ 

Introducing 

(70) T(g,p) := E{q) F\q,p) pw{q) - p'w^q) F{q,p) E\q) , 

a straightforward calculation produces the asserted formula for vfy. □ 
Corollary 4.1. In the given bundle coordinates, the matrix of ttv is 

where T = T{q,p) is defined in (\7(]\l. 

Proof. This is obtained from vry + vfy = l2n- D 

In this chart, 7ry(x)j7' = J'iTy{x), by c<Jo-skew orthogonality of vry. 
Theorem 4.3. Let H be as in ("BJ). Then, the dynamical system locally represented by 



(71) 



q\^f pw\(d,H 
V ) K-P'w -T )\ dpH 
corresponding to the contrained Hamiltonian system {T*Q,uJo, H,V) , is an extension of the 
constrained mechanical system {Q, g, U, W). 
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Proof. By construction, S is an invariant manifold of the associated flow $t, hence ( IHTj) is 
fulfilled for all orbits of ( r7T|) with initial conditions in S. 

The equation q = p^dpH in ([7T)) obviously is ( 1^. 

Next, using the notation / := (/i, . . . , fn-k)\ 

1 = g-'p, 

and substituting ( ITOj) for T{q,p), the equation for p in ( [7T|) becomes 

Since are invariant manifolds of the flow $t generated by ( E)), dtfi{q{t),p(t)) 
vanishes along all orbits of ( 17 so that F'^'g + E'^g^^p = 0. This implies that 

(72) p = -pI d,H + EE^g~'p + • 

RecaUing that pw = g~^EE^ from f IUIJ|) . and using the fact that / = on 5, one arrives at 
( lUUj) by multiplication with from the left. □ 

4.2.3. Equilibria of the extension. The critical set of the extension constructed above is 
characterized by the following theorem. 

Theorem 4.4. The critical set of (\71\) is given by the vector bundle 

^ = U ^ (^.*)^ 

with base space (tg, cf. f \U^) . 

Proof. Let us first consider { I72j). As has been stated above, the second term on its right 
hand equals p\\/{q)p, and moreover, from (ESI), one concludes that 

The Hamiltonian ( can be decomposed into 

H{q,p) = H{q,p\^p) + ^||p{^p||J* , 
due to the 5f*-orthogonality of p|y and p|y, so that ( 17^ can be written as 

P = - Pw9qH{q, p\yP) + p\^P . 

The equilibria of ( [7T|l are therefore determined by the conditions 
Pwi<l)P = > PwiQ) dqH{q,p\yp) = 0. 
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Because H depends quadratically on p\yp, the second condition can be reduced to 

pliq) d,U{q) = 

using the first condition. Comparing this with ( lU^. the assertion follows. □ 



In particular, this fact implies that every equilibrium {qo,Po) of the extension defines 
a unique equilibrium go on £q by base point projection. 

To analyze the stability of a given equilibrium solution go ^ ^q? it is necessary to 
determine the spectrum of the linearization of Xj^ at a = (go, 0). 

A straightforward calculation along the lines of the previous discussion shows that in 
the present bundle chart, 

where 

(74) [Rjk] := [d.^UipwYj ipwYkOgsipwYr] G Mat^{n x n) . 

Furthermore, D'^JJ is the matrix of second derivatives of U . The stability discussion in the 
previous section can now straightforwardly be applied to DXY^^a). 

4.2.4. Extension of symmetries. Let us assume that the constrained mechanical system 
(Q, U, W) exhibits a G-symmetry ip : G DiS{Q). Then, we claim that it is extended by 
{T*Q, luq, H, V). To this end, we recall that the 1-forms (j satisfy iplCi for all h E G close to 
the unit. 

Via its pullback, ip induces the group action 

^ := V'* : Gx T*Q — > T*Q 

on T*Q. This group action is symplectic, that is, ^E'^cuo = for all h E G. For a proof, 
consider for instance pp. 

The 1-forms defined in flU^. satisfy \E'^0 = 0; and likewise, fio'^h = fi is satisfied 
for all h E G close to the unit. The definition of V in proposition 14. II thus implies that 

■^h*V = V 

is satisfied for all h E G. Due to the fact that u and V are both G-invariant, ny and 7fy are 
also invariant under the G-action 
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The Hamiltonian H in f I54j) is G-invariant under by assumption on the constrained 
Hamiltonian mechanical system. Thus, fulfills '^^^Xh = Xh for all h ^ G, which implies 
that X)^ = 71v{Xh) is G- invariant. 

4.3. The Topology of the Critical Manifold. Since <t is not a compact submanifold of 
T*Q, our previous results cannot be applied directly. However, owing to the vector bundle 
structure of C and T*Q, the result 

(75) A^+^Mimi/,P(C,) = ^ A^dimi/^f (T*g) + (1 + A)Q(A) 

i,p p 

still holds, where H* denotes the de Rham cohomology based on differential forms with 
compact supports. The polynomial Q{t) has non-negative integer coefficients. 

In a first step, the arguments of section |21 can be straightforwardly applied to €q. €q 
is normal hyperbolic with respect to the gradient-like fiow ipt generated by 

dtq{t) = -pwiqim,U{q{t)), 

it contains all critical points of the Morse function U, but no other conditional extrema 
of U\^Q apart from those (it is gradient-like because along all of its non-constant orbits, 
^U{t) = —g{pw^ gU, pw^ gU)\g(t) < holds, since pw is an orthogonal projection tensor 
with respect to the Riemannian metric g on Q). This can be proved by substituting M ^ Q, 
H U, TTv ^ Pw, 5'(Kahier) 9, and CC — s> £q in section 12 and by applying the arguments 
used there. Hence, letting pi denote the index of the connectivity component of Cg, 
(Uni) implies that for compact, closed Q, 

(76) A^+'^'dimi/f (Cq,) = J2 AMimi/P(g) + (1 + A)Q(A) , 

i,p p 

where Q{t) is a polynomial with non- negative integer coefficients. 

£q, being the zero section of C, is a deformation retract of and likewise, Q is a 
deformation retract of T*Q. Thus, ( I75|) follows trivially from the invariance of the de Rham 
cohomology groups under retraction, HP{€i) = HP{(tQi), Hp{T*Q) = Hp{Q). Hence, {UB 
is equivalent to 

(77) 5^ A^+^'6,(€q,) = 5^ A%(Q) + (1 + A)Q(A), 

i,p p 

where bp is the p-th Betti number. 
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Consequently, one finds J2i ^p-Mi(^Q*) — particular, for A = —1, one obtains 

i,p i 

where x denotes the Euler characteristic. 

5. Applications, Illustrations and Examples 

Let us conclude our analysis with the discussion of some simple applications and ex- 
amples. 

5.1. A Computational Application. Let us first formulate an application of our analysis 
for the computational problem of finding the equilibria in a large constrained multibody 
system. It is in this context also desirable to determine whether a given set of parameters 
and constraints implies the existence of non-generic critical points. This is due to the cir- 
cumstance that in practice, manufacturing imprecisions can have a significant effect on the 
latter. 

For large multibody systems, equilibria can realistically only be determined by numer- 
ical routines. The strategy presented in chapters |21 and 0] suggests the following method. 

If ?7 is a Morse function whose critical points are known, and if Q is compact and 
closed, it is possible to numerically construct all generic connectivity components of €q. 
This is because generic components of €q are smooth, n — /c-dimensional submanifolds of Q 
containing all critical points of U, and no other critical points of f/lcg. This information can 
be exploited to find sufficiently many points on (tq, so that a suitable interpolation routine 
enables the approximate reconstruction of an entire connectivity component. To this end, 
one chooses a vicinity of a critical point a of U, and uses a fixed point solver to determine 
neighboring zeros of \pw{(l)^gU{<l)\^, which are elements of (tq close to a. Iterating this 
procedure with the critical points found in this manner, pieces of €q of arbitrary size can be 
determined. 

If all critical points of U are a priori known, one can proceed in this manner to construct 
all connectivity components of that contain critical points of U. Then, one is guaranteed 
to have found all of the generic components of if the numerically determined connectivity 
components are closed, compact, and contain all critical points of U. 

We remark that determining the critical points of a Morse function U : Q ^ M is a 
difficult numerical task by itself. Attempting to find critical points by simulating the gradient 
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flow generated by —VgU is time costly, because the critical points deflne a thin set in M. 
Their existence, however, is of course ensured by the topology of Q. 

Another remark is that all critical points a at which D{pwV gU){a) has a reduced rank, 
are elements of the non-generic part of €q. Thus, the latter condition is an indicator for non- 
genericity. If there are such exceptional critical points in a technically relevant region of Q, 
they can be removed by a small local modification of the system parameters or constraints. 

5.2. A disc in a periodic potential, sliding on the plane. Let us consider a mechanical 
example, consisting of a thin disk of radius r and mass m on the plane M^, which is attached 
to a masslcss skate. The connecting line between the center of the disc and the contact 
point at the center of the skate with the plane is normal to the plane, precisely if the disc 
is horizontal. We assume that the disk remains horizontal during its motion, and that the 
translational motion of the disc is only possible in the direction of the skate. 

Let (xi, X2) denote the position of the center of mass of the disc with respect to some 
Euchdean coordinate system on R^, and let (f) denote the angle enclosed by the skate and 
the Xi-axis. 

The kinetic energy of this system is given by 

which defines a Riemannian metric on TQ with metric tensor 

/ m 
[gi,{<P,e,iP)]= m 
\ ^ 

Furthermore, we assume that it moves against the background of a (27rZ) "^-periodic potential 
energy 

U{xi,X2,(f)) = ^2 ^'i'i^ ~ COSXi) +C</,(1 - COS0) , 
1=1,2 

where ci, C2, and c<^ are coupling constants. 

Dividing out the translational symmetry with respect to (27rZ)^, the configuration 
manifold of this mechanical system is given by Q = [0, 27r]^ = (periodic boundary con- 
ditions). Clearly, U : — > M is a real analytic Morse function, with 8 critical points in 
the corners of [0, tt]^, while each of the remaining critical points in [0, 2n]^ is identified with 
one of the former by periodicity. Correspondingly, we will from here on consider {xi,X2) as 
coordinates on T^, that is, mod 2n. 
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The requirement that the disk shall slide in the direction of the skate is expressed by 
the non-holonomic constraint 

Xi sin (p — X2 cos = 0. 
The matrix E'^{xi,X2,4'), introduced in the proof of theorem 14.11 thus corresponds to 

E\xi,X2,(p) = (sin 0, — cos 0,0), 

so that E^g-^E = i. 

The orthoprojectors pw and pw are thus straightforwardly obtained as 

sin^ — sin cos 
a:2, 0) = | — sin0cos0 cos^ 




Pw{xi,X2^(l)) = I sin cos sin^ 



cos^ sin cos 
) cos sin^ 
1 



The critical set is given by 

= {(a^l.^2,0) (p{yVf/)(Xi,X2,0) = o| 

(where V := {d^^,d^2,d^)). Let 

^a,b ■= |(xi,X2,0) xi=a, X2 e [O, 27r] , = fej . 

Then, 

<^Q= U '^-'b. 

a,b€{0,n} 

It is trivially clear that €q contains all critical points of U. Let G €a,b, where a,b E {0, tt}. 
Noting that pw = diag(l, 0, 1) on Cq, we have 

(V ® (pl^Vt/)) (g,) = [p\v{W®\/U)pw)iqc) + Riqc) 

(c\ cos a C2 sin X2 

cos h 

Clearly, 

spec^(^V (g) (p{4,Vf/)j(gc)) = |o, Ci cosa, cosfej , 

which is, for each fixed a, 6, independent of X2. Thus, the indices of the connectivity compo- 
nents Ca^fc with respect to the gradient-like flow generated by — pj^ Vf/ are given by 

(79) /i(c:o,o) = 2 , ^(Co,.) = /i(c:^,o) = 1 , /i(c:.,.) = o , 
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and clearly, <ta,b — for all a,b E {0,7r}. Since the Betti numbers of are given by 
bo = bs = 1, bi = b2 = 3, and those of (ta,b by 6o(C,b) = = 1, &2(Cfe) = &3(C,b) = 0, 

one finds that 

VM(Ca,i,)(^a,b) = bp{Q) 

a,b 

for p = 0, . . . , 3, or explicitly, 

63-2(2:0,0) = l = b3{T') 
62-2(2:0,0) + &2-i(Co,.) + 62-1 (^.,o) = 3 = b2{T') 
bi-o{^^,n) + bi-ii€o,n) + bi-i{<t^,o) = 3 = bi{T') 

bo-oK,.) = l = bo{T'), 

in agreement with ( I77j) . 

Next, we determine the spectrum of the linearization of Xj^ at {qc, 0) G T*Q, cf. f 17^]) . 
To this end, 

/ ^ 

(pw9-'pIv){Qc)= 

and multiplying this matrix from the right with ( ITHj) yields 

ci COS g g C2 sin 3^2 
m m 

fi(g„^):= I 

Clearly, 



g g 2c0 COS b 



spec fi(gc, ^) = <^ 0, , \ 

\ Jim mr'^ ) 

From ( ITSj) , it is easy to see that 



spec(M]^(g„0)) = |o,± 



Ci cos a / 2c<i cos b 

'^V — ~ 

m V mr'^ 



hence critical stability occurs for the case a = b = n, while in all other cases, there is an 
asymptotically unstable direction. 

We conclude that all components *ta^b, where a + b < n, are unstable. In the critically 
stable case a = b = it, the linear problem is oscillatory, and the eigenfrequencies are given by 

and ■\J independently of X2. Since fi^C^^n) = 0, our discussion in section El suggests 
that the connectivity component ^Tr^n of (tg is stable in the sense of Nekhoroshev if the ratio 



is irrational. 
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